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HELICOBACTER GHPQ 1360 AND GHPO 750 POLYPEPTIDES AND 
CORRESPONDING POLYNUCLEOTIDE MOLECULES 
The invention relates to Helicobacter antigens and corresponding 
polynucleotide molecules that can be used in methods to prevent or treat 
Helicobacter infection in mammals, such as humans. 



Background of the Invention 

Helicobacter is a genus of spiral, gram-negative bacteria that colonize 
the gastrointestinal tracts of mammals. Several species colonize the stomach, most 
notably H. pylori, H. heilmanii, Kfelis, and K mustelae. Although H. pylori is 
the species most commonly associated with human infection, K heilmanii and H. 
felis have also been isolated from humans, but at lower frequencies than H. pylori. 
Helicobacter infects over 50% of adult populations in developed countries and 
nearly 100% in developing countries and some Pacific rim countries, making it 
one of the most prevalent infections worldwide. 

Helicobacter is routinely recovered from gastric biopsies of humans 
with histological evidence of gastritis and peptic ulceration. Indeed, H. pylori is 
now recognized as an important pathogen of humans, in that the chronic gastritis it 
causes is a risk factor for the development of peptic ulcer diseases and gastric 
carcinoma. It is thus highly desirable to develop safe and effective vaccines for 
preventing and treating Helicobacter infection. 

A number of Helicobacter antigens have been characterized or isolated. 
These include urease, which is composed of two stmctural subunits of 



approximately 30 and 67 kDa (Hu et aL, Infect. Immun. 58:992, 1990; Dunn et al., 
J. Biol. Chem. 265:9464, 1990; Evans et aL, Microbial Pathogenesis 10:15, 1991; 
Labigne et al, J. Bact, 173:1920, 1991); the 87 kDa vacuolar cytotoxin (VacA) 
(Cover et al, J. Biol. Chem. 267:10570, 1992; Phadnis et al. Infect. Immun. 
5 62:1557, 1994; WO 93/18150); a 128 kDa immunodominant antigen associated 
with the cytotoxin (CagA, also called TagA; WO 93/18150; U.S. Patent No. 
5,403,924); 13 and 58 kDa heat shock proteins HspA and HspB (Suerbaum et al, 
Mol. Microbiol. 14:959, 1994; WO 93/18150); a 54 kDa catalase (Hazell et al, J. 
Gen. Microbiol.l37:57, 1991); a 15 kDa histidine-rich protein (Hpn) (Gilbert et 

10 al. Infect. Immim. 63:2682, 1995); a 20 kDa membrane-associated lipoprotein 
(Kostrcynska et al, J. Bact. 176:5938, 1994); a 30 kDa outer membrane protein 
(Bolin et al, J. Clin. Microbiol. 33:381, 1995); a lactoferrin receptor 
(FR 2,724,936); and several porins, designated HopA, HopB, HopC, HopD, and 
HopE, which have molecular weights of 48-67 kDa (Exner et al. Infect. Immun. 

15 63:1567, 1995; Doig et al, J. Bact. 177:5447, 1995). Some of these proteins have 
been proposed as potential vaccine antigens. In particular, urease is believed to be 
a vaccine candidate (WO 94/9823; WO 95/22987; WO 95/3824; Michetti etal. 
Gastroenterology 107:1002, 1994). Nevertheless, it is tiiought that several 
antigens may ultimately be necessary in a vaccine. 

20 Summary of the Invention 

The invention provides polynucleotide molecules that encode 
Helicobacter polypeptides, designated GHPO 1360 (32 kDa) and GHPO 750 (50 
kDa), which can be used, e.g., in methods to prevent, treat, or diagnose 
Helicobacter infection. The polypeptides include those having the amino acid 
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sequences shown in SEQ ID N0s:2 (GHPO 1360) and 4 (GHPO 750). Those 
skilled in the art will understand that the invention also includes polynucleotide 
molecules that encode mutants and derivatives of these polypeptides, which can 
result from the addition, deletion, or substitution of non-essential amino acids, as is 
5 described further below. 

In addition to the polynucleotide molecules described above, the 
invention includes the corresponding polypeptides {i.e., polypeptides encoded by 
the polynucleotide molecules of the invention, or fragments thereof), and 
monospecific antibodies that specifically bind to these polypeptides. 
1 0 The present invention has many applications and includes expression 

0 cassettes, vectors, and cells transformed or transfected with the polynucleotides of 
y the invention. Accordingly, the present invention provides (i) methods for 

il producing polypeptides of the invention in recombinant host systems and related 
I?I expression cassettes, vectors, and transformed or transfected cells; (ii) live vaccine 
15 vectors, such as pox virus, Salmonella typhimurium, and Vibrio cholerae vectors, 

1 that contain polynucleotides of the invention (such vaccine vectors being usefiil in, 
y ^.g., methods for preventing or treating Helicobacter infection) in combination 

fU with a diluent or carrier, and related pharmaceutical compositions and associated 
therapeutic and/or prophylactic methods; (iii) therapeutic and/or prophylactic 
2 0 methods involving administration of polynucleotide molecules, either in a naked 
form or formulated with a delivery vehicle, polypeptides or mixtures of 
polypeptides, or monospecific antibodies of the invention, and related 
pharmaceutical compositions; (iv) methods for detecting the presence of 
Helicobacter in biological samples, which can involve the use of polynucleotide 
2 5 molecules, monospecific antibodies, or polypeptides of the invention; and 
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(v) methods for purifying polypeptides of the invention by antibody-based affinity 
chromatography. 

Detailed Description 
Open reading frames (ORFs) encoding new, full length polypeptides, 
5 designated GHPO 1360 (32 kDa) and GHPO 750 (50 kDa), have been identified in 
the K pylori genome. These polypeptides can be used, for example, in 
vaccination methods for preventing or treating Helicobacter infection. GHPO 
1360 is a membrane-associated, secreted polypeptide that can be produced in its 
mature form {i.e., as a polypeptide that has been exported through class II or class 
O 1 0 III secretion pathways) or as a precursor that includes a signal peptide, which can 
y be removed in the course of excretion/secretion by cleavage at the N-terminal end 
1=^ of the mature form. (The cleavage site is located at the C-terminal end of the 

signal peptide, adjacent to the mature form.) The cleavage site for GHPO 1360 
"f% and, thus, the first amino acid of the mature GHPO 1360 polypeptide, was 

^15 putatively deterroined. 

^ According to a first aspect of the invention, there are provided isolated 

polynucleotides that encode the precursor and mature forms of Helicobacter 
GHPO 1360. These isolated polynucleotides encode (i) a polypeptide having an 
amino acid sequence that is homologous to a Helicobacter amino acid sequence of 
20 a polypeptide associated with the Helicobacter membrane, where the Helicobacter 
amino acid sequence is shown in SEQ ID N0:2, beginning with an amino acid in 
any one of the positions from -20 to 5, preferably in position -20 or position 1, and 
ending with an amino acid in position 279 (GHPO 1360); or (ii) a derivative of the 
polypeptide. 
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This aspect of the invention also includes isolated polynucleotides that 
encode Helicobacter GHPO 750. These polynucleotides encode (i) a polypeptide 
having an amino acid sequence that is homologous to a Helicobacter amino acid 
sequence of a polypeptide associated with the Helicobacter membrane, where the 

5 Helicobacter amino acid sequence is shown in SEQ ID N0:4, beginning with an 
amino acid in position 1 and ending with an amino acid in position 399 (GHPO 
750); or (ii) a derivative of the polypeptide. 

The term "isolated polynucleotide" is defined as a polynucleotide that is 
removed from the environment in which it naturally occurs. For example, a 

1 0 naturally-occurring DNA molecule present in the genome of a living bacteria or as 
part of a gene bank is not isolated, but the same molecule, separated from the 
remaining part of the bacterial genome, as a result of, e.g., a cloning event 
(amplification), is "isolated." Typically, an isolated DNA molecule is firee firom 
DNA regions (e.g., coding regions) with which it is immediately contiguous, at the 

15 5' or 3^ ends, in the naturally occurring genome. Such isolated polynucleotides can 
be part of a vector or a composition and still be isolated, as such a vector or 
composition is not part of its natural environment. 

A polynucleotide of the invention can consist of RNA or DNA (e.g., 
cDNA, genomic DNA, or synthetic DNA), or modifications or combinations of 

2 0 RNA or DNA. The polynucleotide can be double-stranded or single-stranded and, 
if single-stranded, can be the coding (sense) strand or the non-coding (anti-sense) 
strand. The sequences that encode polypeptides of the invention, as shown in SEQ 
ID N0s:2 and 4, can be (a) the coding sequence as shown in SEQ ID NOs:l and 3; 
(b) a ribonucleotide sequence derived by transcription of (a); or (c) a different 

2 5 coding sequence that, as a result of the redundancy or degeneracy of the genetic 



code, encodes the same polypeptides as the polynucleotide molecules having the 
sequences illustrated in SEQ ID NOs:l and 3, The polypeptide can be one that is 
naturally secreted or excreted by, e.g., H.felis, H. mustelae, H. heilmanii, ovH, 
pylori. 

By "polypeptide" or "protein" is meant any chain of amino acids, 
regardless of length or post-translational modification {e.g., glycosylation or 
phosphorylation). Both terms are used interchangeably in the present application. 

By "homologous amino acid sequence" is meant an amino acid sequence 
that differs jfrom an amino acid sequence shown in SEQ ID N0:2 or 4, or an amino 
acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or 3, by one or 
more non-conservative amino acid substitutions, deletions, or additions located at 
positions at which they do not destroy the specific antigenicity of the polypeptide. 
Preferably, such a sequence is at least 75%, more preferably at least 80%, and 
most preferably at least 90% identical to an amino acid sequence shown in SEQ ID 
N0s:2 or 4. 

Homologous amino acid sequences include sequences that are identical 
or substantially identical to an amino acid sequence as shown in SEQ ID N0s:2 or 
4. By "amino acid sequence that is substantially identical" is meant a sequence 
that is at least 90%, preferably at least 95%, more preferably at least 97%, and 
most preferably at least 99% identical to an amino acid sequence of reference and 
that differs from the sequence of reference, if at all, by a majority of conservative 
amino acid substitutions. 

Conservative amino acid substitutions typically include substitutions 
among amino acids of the same class. These classes include, for example, amino 
acids having uncharged polar side chains, such as asparagine, glutamine, serine. 



threonine, and tyrosine; amino acids having basic side chains, such as lysine, 
arginine, and histidine; amino acids having acidic side chains, such as aspartic acid 
and glutamic acid; and amino acids having nonpolar side chains, such as glycine, 
alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, 
and cysteine. 

Homology can be measured using sequence analysis software (e.g.. 
Sequence Analysis Software Package of the Genetics Computer Group, University 
of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, WI 
53705). Similar amino acid sequences are aligned to obtain the maximum degree 
of homology {i.e., identity). To this end, it may be necessary to artificially 
introduce gaps into the sequence. Once the optimal alignment has been set up, the 
degree of homology {i.e., identity) is established by recording all of the positions 
in which the amino acids of both sequences are identical, relative to the total 
number of positions. 

Homologous polynucleotide sequences are defined in a similar way. 
Preferably, a homologous sequence is one that is at least 45%, more preferably at 
least 60%, and most preferably at least 85% identical to a coding sequence of SEQ 
ID NOs:l or 3. 

Polypeptides having a sequence homologous to one of the sequences 
shown in SEQ ID N0s:2 and 4, include naturally-occurring allelic variants, as well 
as mutants or any other non-naturally occurring variants that are analogous in 
terms of antigenicity, to a polypeptide having a sequence as shown in SEQ ID 
N0s:2 and 4. 

As is known in the art, an allelic variant is an altemate form of a 
polypeptide that is characterized as having a substitution, deletion, or addition of 



one or more amino acids that does not alter the biological function of the 
polypeptide. By "biological function" is meant a function of the polypeptide in the 
cells in which it naturally occurs, even if the function is not necessary for the 
growth or survival of the cells. For example, the biological function of a porin is 
to allow the entry into cells of compounds present in the extracellular medium. 
The biological function is distinct from the antigenic function. A polypeptide can 
have more than one biological function. 

Allelic variants are very common in nature. For example, a bacterial 
species, e.g., K pylori, is usually represented by a variety of strains that differ 
from each other by minor allelic variations. Indeed, a polypeptide that fulfills the 
same biological function in different strains can have an amino acid sequence that 
is not identical in each of the strains. Such an allelic variation can be equally 
reflected at the polynucleotide level. 

Support for the use of allelic variants of polypeptide antigens comes 
from, e.g., studies of the Helicobacter urease antigen. The amino acid sequence of 
Helicobacter urease varies widely from species to species, yet cross-species 
protection occurs, indicating that the urease molecule, when used as an 
immunogen, is highly tolerant of amino acid variations. Even among different 
strains of the single species H pylori^ there are amino acid sequence variations. 

For example, although the amnio acid sequences of the UreA and UreB 
subunits oiK pylori and Kfelis ureases differ from one another by 26.5% and 
1 1.8%, respectively (Ferrero et al., Molecular Microbiology 9(2):323-333, 1993), 
it has been shown that H. pylori urease protects mice from Kfelis infection 
(Michetti et al. Gastroenterology 107:1002, 1994). In addition, it has been shown 
that the individual structural subunits of urease, UreA and UreB, which contain 




distinct amino acid sequences, are both protective antigens against Helicobacter 
infection (Michetti et al, supra). Similarly, Cuenca et al. (Gastroenterology 
11 0:1 770, 1996) showed that therapeutic immunization of K /wws'te/ae-infected 
ferrets with K pylori urease was effective at eradicating K mustelae infection. 
5 Further, several urease variants have been reported to be effective vaccine 

antigens, including, e.g., recombinant UreA -i- UreB apoenzyme expressed from 
pORV142 (UreA and UreB sequences derived from K pylori strain CPM630; Lee 
et al., J. Infect. Dis.l72:161, 1995); recombinant UreA + UreB apoenzyme 
expressed from pORV214 (UreA and UreB sequences differ from K pylori strain 

1 0 CPM630 by one and two amino acid changes, respectively; Lee et aL, supra, 
1995); a UreA-glutathione-S-transferase ftision protein (UreA sequence from 
H. pylori strain ATCC 43504; Thomas et al.. Acta Gastro-Enterologica Belgica 
56:54, 1993); UreA + UreB holoenzyme purified from K pylori strain 
NCTCl 1637 (Marchetti et al.. Science 267:1655, 1995); a UreA-MBP fiision 

15 protein (UreA from K pylori strain 85P; Ferrero et al. Infection and Immunity 
62:4981, 1994); a UreB-MBP fusion protein (UreB from K pylori strain 85P; 
Ferrero et al, supra)', a UreA-MBP fusion protein (UreA from H.felis strain 
ATCC 49179; Ferrero et al, supra); a UreB-MBP fusion protein (UreB from H. 
felis strain ATCC 49179; Ferrero et al., supra); and a 37 kDa fragment of UreB 

20 containing amino acids 220-569 (Dore-Davin et al, "A 37 kD fragment of UreB is 
sufficient to confer protection against Helicobacter felis infection in mice"). 
Finally, Thomas et al. (supra) showed that oral immunization of mice with crude 
sonicates of H. pylori protected mice from subsequent challenge with H.felis. 

Polynucleotides, e.g., DNA molecules, encoding allelic variants can 

2 5 easily be obtained by polymerase chain reaction (PCR) amplification of genomic 



bacterial DNA extracted by conventional methods. This involves the use of 
synthetic oligonucleotide primers matching sequences that are upstream and 
downstream of the 5' and 3' ends of the coding region. Suitable primers can be 
designed based on the nucleotide sequence information provided in SEQ ID NOs:l 
5 and 3. Typically, a primer consists of 10 to 40, preferably 15 to 25 nucleotides. It 
can also be advantageous to select primers containing C and G nucleotides in 
proportions sufficient to ensure efficient hybridization, e.g., an amount of C and G 
nucleotides of at least 40%, preferably 50%, of the total nucleotide amount. Those 
skilled in the art can readily design primers that can be used to isolate the 

1 0 polynucleotides of the invention from different Helicobacter strains. 

As an example, primers useful for cloning a DNA molecule encoding a 
polypeptide having the amino acid sequence of GHPO 1360 (SEQ ID N0:2), are 
shown in SEQ ID NO: 14 (matching at the 5' end) and in SEQ ID NO: 15 (matching 
at the 3' end). Use of these primers enables amplification of the entire gene 

15 encoding GHPO 1360. Primers having sequences shown in SEQ ID NO: 1 8 
(matching at the 5' end of the coding sequence corresponding to the mature 
protein) and SEQ ID NO: 15 (matching at the 3' end) can be used to amplify the 
portion of the gene encoding mature GHPO 1360. Experimental conditions for 
carrying out PGR can readily be determined by one skilled in the art and an 

2 0 illustration of carrying out PGR is provided in Example 3 . 

Thus, the first aspect of the invention includes (i) isolated 
polynucleotides (e.g., DNA molecules) that can be amplified and/or cloned using 
the polymerase chain reaction from a Helicobacter, e.g., H pylori, genome using a 
5' oligonucleotide primer having a sequence as shown in SEQ ID NO: 14, and a 3' 

25 oligonucleotide primer having a sequence as shown in SEQ ID NO: 15 (GHPO 
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1360); or a 5' oligonucleotide primer having a sequence as shown in SEQ ID 
NO: 16, and a 3' oligonucleotide primer having a sequence as shown in SEQ ID 
NO: 17 (GHPO 750); and (ii) isolated polynucleotide molecules encoding the 
mature form of the GHPO 1360 polypeptide encoded by a polynucleotide 
5 molecule amplified using the primers described above, or amplified by a 3' 
oligonucleotide primer having a sequence as shown in SEQ ID NO: 18 (GHPO 
1360, mature form), and a 3' oligonucleotide primer having a sequence in SEQ ID 
N0:15(GHP0 1360). 

The 5' ends of the primers having the nucleotide sequences of SEQ ID 
, ^ 10 NOs:14-18 can advantageously include a restriction endonuclease recognition site 
5 that contains, typically, 4 to 6 nucleotides. For example, the sequences 5'- 
W GGATCC-3' (BamHl) or 5'-CTCGAG-3' {Xhol) can be used. Restriction sites can 
H be selected by tiiose skilled in the art so that the amplified DNA, when digested, if 
, :j necessary, can be conveniently cloned into an appropriately digested vector, such 
p 15 as a plasmid vector. In addition, a 5' clamp can be included in the primers 5^ to the 
W restriction endonuclease recognition site. 



known methods for identifying regions of an antigen that are likely to be tolerant 
of amino acid sequence changes and/or deletions. For example, sequences of the 
2 0 antigen fi:om different species can be compared to identify conserved sequences. 



invention include, e.g., fragments, polypeptides having large intemal deletions 
derived from full-length polypeptides, and fusion proteins. Polypeptide fragments 
of the invention can be derived from a polypeptide having a sequence homologous 
25 to the sequences of SEQ ID N0s:2 or 4, to the extent that the firagments retain the 



o 



Useful homologs that do not occur naturally can be designed using 



Polypeptide derivatives that are encoded by polynucleotides of the 
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substantial antigenicity of the parent polypeptide (specific antigenicity). 
Polypeptide derivatives can also be constructed by large internal deletions that 
remove a substantial part of the parent polypeptide, while retaining specific 
antigenicity. Generally, polypeptide derivatives should be about at least 12 amino 
5 acids in length to maintain antigenicity. Advantageously, they can be at least 
20 amino acids, preferably at least 50 amino acids, more preferably at least 75 
amino acids, and most preferably at least 100 amino acids in length. 

Useful polypeptide derivatives, e.g., polypeptide fragments, can be 
designed using computer-assisted analysis of amino acid sequences in order to 

1 0 identify sites in protein antigens having potential as surface-exposed, antigenic 
regions (Hughes et al. Infect. Immun. 60(9):3497, 1992). For example, the Laser 
Gene Program from DNA Star can be used to obtain hydrophilicity, antigenic 
index, and intensity index plots for the polypeptides of the invention. This 
program can also be used to obtain information about homologies of the 

15 polypeptides with known protein motifs. One skilled in the art can readily use the 
information provided in such plots to select peptide fragments for use as vaccine 
antigens. For example, fragments spanning regions of the plots in which the 
antigenic index is relatively high can be selected. One can also select fragments 
spanning regions in which both the antigenic index and the intensity plots are 

2 0 relatively high. Fragments containing conserved sequences, particularly 
hydrophilic conserved sequences, can also be selected. 

Polypeptide fragments and polypeptides having large intemal deletions 
can be used for revealing epitopes that are otherwise masked in the parent 
polypeptide and that may be of importance for inducing a protective T cell- 
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dependent immune response. Deletions can also remove immunodominant regions 
of high variability among strains. 

It is an accepted practice in the field of immunology to use fragments 
and variants of protein immunogens as vaccines, as all that is required to induce an 
immune response to a protein is a small (e.g., 8 to 10 amino acids) immunogenic 
region of the protein. This has been done for a number of vaccines against 
pathogens other than Helicobacter. For example, short synthetic peptides 
corresponding to surface-exposed antigens of pathogens such as murine mammary 
tumor virus (peptide containing 1 1 amino acids; Dion et al. Virology 179:474- 
477, 1990), Semliki Forest vims (peptide containing 16 amino acids; Snijders et 
al.^ J. Gen. Virol, 72:557-565, 1991), and canine parvovirus (2 overlapping 
peptides, each containing 15 amino acids; Langeveld et al. Vaccine 12(15):1473- 
1480, 1994) have been shown to be effective vaccine antigens against their 
respective pathogens. 

Polynucleotides encoding polypeptide fragments and polypeptides 
having large intemal deletions can be constructed using standard methods (see, 
e.g., Ausubel et aL, Current Protocols in Molecular Biology, John Wiley & Sons 
Inc., 1994), for example, by PGR, including inverse PGR, by restriction enzyme 
treatment of the cloned DNA molecules, or by the method of Kunkel et al (Proc. 
Natl. Acad. Sci. USA 82:448, 1985; biological material available at Stratagene). 

A polypeptide derivative can also be produced as a fusion polypeptide 
that contains a polypeptide or a polypeptide derivative of the invention fused, e.g., 
at the N- or C-terminal end, to any other polypeptide (hereinafter referred to as a 
peptide tail). Such a product can be easily obtained by translation of a genetic 
fusion, i.e., a hybrid gene. Vectors for expressing ftision polypeptides are 
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commercially available, and include the pMal-c2 or pMal-p2 systems of New 
England Biolabs, in which the peptide tail is a maltose binding protein, the 
glutathione-S-transferase system of Pharmacia, or the His-Tag system available 
from Novagen. These and other expression systems provide convenient means for 
further purification of polypeptides and derivatives of the invention. 

Another particular example of fusion polypeptides included in invention 
includes a polj^eptide or polypeptide derivative of the invention fused to a 
polypeptide having adjuvant activity, such as, e.g., subunit B of either cholera 
toxin or E. coli heat-labile toxin. Several possibilities can be used for producing 
such fusion proteins. First, flie polypeptide of the invention can be fused to the 
N-terminal end or, preferably, to the C-terminal end of the polypeptide having 
adjuvant activity. Second, a polypeptide fragment of the invention can be fused 
within the amino acid sequence of the polypeptide having adjuvant activity. 
Spacer sequences can also be included, if desired. 

As stated above, the polynucleotides of the invention encode 
Helicobacter polypeptides in precursor or mature form. They can also encode 
hybrid precursors containing heterologous signal peptides, which can mature into 
polypeptides of the invention. By "heterologous signal peptide" is meant a signal 
peptide that is not found in the naturally-occurring precursor of a polypeptide of 
the invention. 

A polynucleotide of the invention hybridizes, preferably under stringent 
conditions, to a polynucleotide having a sequence as shown in SEQ ID NOs:l or 3. 
Hybridization procedures are, e.g., described by Ausubel et aL (supra); Silhavy et 
ah {Experiments with Gene Fusions, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York, 1984); and Davis et al. (A Manual for Genetic 
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Engineering: Advanced Bacterial Genetics, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, New York, 1980). Important parameters that can be 
considered for optimizing hybridization conditions are reflected in the following 
formula, which facilitates calculation of the melting temperature (Tm), which is 
5 the temperature above which two complementary DNA strands separate from one 
another (Casey et al, Nucl. Acid Res. 4:1539, 1977): Tm = 81 .5 + 0.5 x (% G+C) 
+ 1 .6 log (positive ion concentration) - 0.6 x (% formamide). Under appropriate 
stringency conditions, hybridization temperature (Th) is approximately 20 to 
40°C, 20 to 25 °C, or, preferably, 30 to 40°C below the calculated Tm. Those 
1 0 skilled in the art will understand that optimal temperature and salt conditions can 
C be readily determined empirically in preliminary experiments using conventional 
~; procedures. For example, stringent conditions can be achieved, both for pre- 
^ hybridizing and hybridizing incubations, (i) within 4-16 hours at 42 "C, in 6 x SSC 

containing 50% formamide or (ii) within 4-16 hours at 65 °C in an aqueous 6 x 
1 15 SSC solution (1 M NaCl, 0. 1 M sodium citrate (pH 7.0)). For polynucleotides 
:;■ containing 30 to 600 nucleotides, the above formula is used and then is corrected 
, - by subtracting (600/polynucleotide size in base pairs). Stringency conditions are 
fi defmed by a Th that is 5 to 1 0 °C below Tm. 

Hybridization conditions with oligonucleotides shorter than 20-30 bases 
20 do not precisely follow the rules set forth above. In such cases, the formula for 
calculating the Tm is as follows: Tm = 4 x (G+C) + 2 (A+T). For example, an 
18 nucleotide fragment of 50% G+C would have an approximate Tm of 54°C. 

A polynucleotide molecule of the invention, containing RNA, DNA, or 
modifications or combinations thereof, can have various applications. For 
2 5 example, a polynucleotide molecule can be used (i) in a process for producing the 
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encoded polypeptide in a recombinant host system, (ii) in the constraction of 
vaccine vectors such as poxviruses, which are further used in methods and 
compositions for preventing and/or treating Helicobacter infection, (iii) as a 
vaccine agent, in a naked form or formulated with a delivery vehicle and, (iv) in 
5 the construction of attenuated Helicobacter strains that can over-express a 
polynucleotide of the invention or express it in a non-toxic, mutated form. 

According to a second aspect of the invention, there is therefore 
provided (i) an expression cassette containing a polynucleotide molecule of the 
invention placed under the control of elements {e.g., a promoter) required for 

10 expression; (ii) an expression vector containing an expression cassette of the 

invention; (iii) a procaryotic or eucaryotic cell transformed or transfected with an 
expression cassette and/or vector of the invention, as well as (iv) a process for 
producing a polypeptide or polypeptide derivative encoded by a polynucleotide of 
the invention, which involves culturing a procaryotic or eucaryotic cell 

15 transformed or transfected with an expression cassette and/or vector of the 

invention, under conditions that allow expression of the polynucleotide molecule 
of the invention and, recovering the encoded polypeptide or polypeptide derivative 
from the cell culture. 

A recombinant expression system can be selected from procaryotic and 

2 0 eucaryotic hosts. Eucaryotic hosts include, for example, yeast cells (e.g., 

Saccharomyces cerevisiae or Pichia Pastoris), mammalian cells {e.g., COSl, 
NIH3T3, or JEG3 cells), arthropods cells (e.g., Spodopterafrugiperda (SF9) cells), 
and plant cells. Preferably, a procaryotic host such as E. coli is used. Bacterial 
and eucaryotic cells are available from a number of different sources that are 
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known to those skilled in the art, e.g., the American Type Culture Collection 
(ATCC; Rockville, Maryland). 

The choice of the expression cassette will depend on the host system 
selected, as well as the features desired for the expressed polypeptide. For 
5 example, it may be useful to produce a polypeptide of the invention in a particular 
lipidated form or any other form. Typically, an expression cassette includes a 
constitutive or inducible promoter that is functional in the selected host system; a 
ribosome binding site; a start codon (ATG); if necessary, a region encoding a 
signal peptide, e.g., a lipidation signal peptide; a polynucleotide molecule of the 
^10 invention; a stop codon; and, optionally, a 3' terminal region (translation and/or 

□ transcription terminator). The signal peptide-encoding region is adjacent to the 

O 

U polynucleotide of the invention and is placed in the proper reading frame. The 

U signal peptide-encoding region can be homologous or heterologous to the 

y polynucleotide molecule encoding the mature polypeptide and it can be specific to 

□ 15 the secretion apparatus of the host used for expression. The open reading frame 

g constituted by the polynucleotide molecule of the invention, alone or together with 
the signal peptide, is placed under the control of the promoter so that transcription 

nJ and translation occur in the host system. Promoters and signal peptide-encoding 
regions are widely known and available to those skilled in the art and include, for 
2 0 example, the promoter of Salmonella typhimurium (and derivatives) that is 
inducible by arabinose (promoter araB) and is functional in Gram-negative 
bacteria such as E. coli (U.S. Patent No. 5,028,530; Cagnon et al.. Protein 
Engineering 4(7):843, 1991); the promoter of the bacteriophage T7 RNA 
polymerase gene, which is functional in a number of E, coli strains expressing T7 
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polymerase (U.S. Patent No. 4,952,496); the OspA lipidation signal peptide; and 
RlpB lipidation signal peptide (Takase etal, J. Bact. 169:5692, 1987). 

The expression cassette is typically part of an expression vector, which 
is selected for its ability to replicate in the chosen expression system. Expression 
5 vectors {e.g., plasmids or viral vectors) can be chosen from, for example, those 
described in Pouwels et al. {Cloning Vectors: A Laboratory Manual, 1985, Supp. 
1987) and can purchased from various commercial sources. Methods for 
transforming or transfecting host cells with expression vectors are well known in 
the art and will depend on the host system selected, as described in Ausubel et al. 
10 {supra). 

Upon expression, a recombinant polypeptide of the invention (or a 
polypeptide derivative) is produced and remains in the intracellular compartment, 
is secreted/excreted in the extracellular mediimi or in the periplasmic space, or is 
embedded in the cellular membrane. The polypeptide can then be recovered in a 

1 5 substantially purified form from the cell extract or from the supernatant after 

centrifixgation of the cell culture. Typically, the recombinant polypeptide can be 
purified by antibody-based affinity purification or by any other method known to a 
person skilled in the art, such as by genetic fiision to a small affinity-binding 
domain. Antibody-based affinity purification metfiods are also available for 

2 0 purifying a polypeptide of the invention extracted from a Helicobacter strain. 
Antibodies useful for mimunoafiinity purification of the polypeptides of the 
invention can be obtained using methods described below. 

Polynucleotides of the invention can also be used in DNA vaccination 
methods, using either a viral or bacterial host as gene delivery vehicle (live vaccine 

2 5 vector) or administering the gene in a free form, e.g., inserted into a plasmid. 
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Therapeutic or prophylactic efficacy of a polynucleotide of the invention can be 
evaluated as is described below. 

Accordingly, in a third aspect of the invention, there is provided (i) a 
vaccine vector such as a poxvirus, containing a polynucleotide molecule of the 
5 invention placed under the control of elements required for expression; (ii) a 

composition of matter containing a vaccine vector of the invention, together with a 
diluent or carrier; (iii) a pharmaceutical composition containing a therapeutically 
or prophylactically effective amount of a vaccine vector of the invention; (iv) a 
method for inducing an immune response against Helicobacter in a mammal (e.g., 

10 a human; alternatively, the method can be used in veterinary appUcations for 

treating or preventing Helicobacter infection of animals, e.g., cats or birds), which 
involves administering to the mammal an immunogenically effective amount of a 
vaccine vector of the invention to elicit an immune response, e.g., a protective or 
therapeutic immune response to Helicobacter, and (v) a method for preventing 

1 5 and/or treating a Helicobacter (e.g., H pylori, H. felis, H. mustelae, or H. 

heilmanii) infection, which involves administering a prophylactic or therapeutic 
amount of a vaccine vector of the invention to an individual in need. Additionally, 
the third aspect of the invention encompasses the use of a vaccine vector of the 
invention in the preparation of a medicament for preventing and/or treating 

20 Helicobacter infectioa. 

A vaccine vector of the invention can express one or several 
polypeptides or derivatives of the invention, as well as at least one additional 
Helicobacter antigen such as a urease apoenzyme or a subunit, fragment, homolog, 
mutant, or derivative thereof. In addition, it can express a cytokine, such as 

2 5 interleukin-2 (IL-2) or interleukin- 12 (IL- 1 2), that enhances the immime response. 
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Thus, a vaccine vector can include an additional polynucleotide molecules 
encoding, e.g., urease subunit A, B, or both, or a cytokine, placed under the control 
of elements required for expression in a mammalian cell 

Altematively, a composition of the invention can include several 
vaccine vectors, each of which being capable of expressing a polypeptide or 
derivative of the invention. A composition can also contain a vaccine vector 
capable of expressing an additional Helicobacter antigen such as urease 
apoenzyme, a subunit, fragment, homolog, mutant, or derivative thereof, or a 
cytokine such as IL-2 or IL-12. 

In vaccination methods for treating or preventing infection in a 
mammal, a vaccine vector of the invention can be administered by any 
conventional route in use in the vaccine field, for example, to a mucosal {e.g., 
ocular, intranasal, oral, gastric, pulmonary, intestinal, rectal, vaginal, or urinary 
tract) surface or via a parenteral (e.g., subcutaneous, intradermal, intramuscular, 
intravenous, or intraperitoneal) route. Preferred routes depend upon the choice of 
the vaccine vector. The administration can be achieved in a single dose or 
repeated at intervals. The appropriate dosage depends on various parameters that 
are understood by those skilled in the art, such as the nature of the vaccine vector 
itself, the route of administration, and the condition of the mammal to be 
vaccinated (e.g., the weight, age, and general health of the mammal). 

Live vaccine vectors that can be used in the invention include viral 
vectors, such as adenoviruses and poxviruses, as well as bacterial vectors, e.g., 
Shigella, Salmonella, Vibrio cholerae, Lactobacillus, Bacille bilie de Calmette- 
Guerin (BCG), and Streptococcus. An example of an adenovirus vector, as well as 
a method for constructing an adenovirus vector capable of expressing a 



polynucleotide molecule of the invention, is described in U.S. Patent No. 
4,920,209. Poxvirus vectors that can be used in the invention include, e.g., 
vaccinia and canary pox viruses, which are described in U.S. Patent No. 4,722,848 
and U.S. Patent No. 5,364,773, respectively (also see, e.g., Tartaglia et ah. 
Virology 188:217, 1992, for a description of a vaccinia virus vector, and Taylor et 
al. Vaccine 13:539, 1995, for a description of a canary poxvirus vector). Poxvirus 
vectors capable of expressing a polynucleotide of the invention can be obtained by 
homologous recombination, as described in Kieny et al. (Nature 312:163, 1984) so 
that the polynucleotide of the invention is inserted in the viral genome under 
appropriate conditions for expression in mammalian cells. Generally, the dose of 
viral vector vaccine, for therapeutic or prophylactic use, can be from about 1x10* 
to about 1x10", advantageously from about 1x10' to about 1x10^^ or, preferably, 
from about 1x10' to about 1x10' plaque-forming units per kilogram. Preferably, 
viral vectors are administered parenterally, for example, in 3 doses that are 4 
weeks apart. Those skilled in the art will recognize that it is preferable to avoid 
adding a chemical adjuvant to a composition containing a viral vector of the 
invention and tiiereby minimizing the immune response to the viral vector itself, 

Non-toxicogenic Vibrio cholerae mutant strains that can be used in live 
oral vaccines are described by Mekalanos et al. (Nature 306:551, 1983) and in 
U.S. Patent No. 4,882,278 (strain in which a substantial amoimt of the coding 
sequence of each of the two ctxA alleles has been deleted so that no fiinctional 
cholerae toxin is produced); WO 92/1 1354 (strain in which the irgA locus is 
inactivated by mutation; this mutation can be combined in a single strain with ctxA 
mutations); and WO 94/1533 (deletion mutant lacking functional ctxA and attRSl 
DNA sequences). These strains can be genetically engineered to express 



heterologous antigens, as described in WO 94/19482. An effective vaccine dose of 
a V. cholerae strain capable of expressing a polypeptide or polypeptide derivative 
encoded by a polynucleotide molecule of the invention can contain, e.g., about 
1x10^ to about 1x10', preferably about 1x10^ to about 1x10* viable bacteria in an 
appropriate volume for the selected route of administration. Preferred routes of 
administration include all mucosal routes, but, most preferably, these vectors are 
administered intranasally or orally. 

Attenuated Salmonella typhimurium strains, genetically engineered for 
recombinant expression of heterologous antigens, and their use as oral vaccines, 
are described by Nakayama et al. (Bio/Technology 6:693, 1988) and in WO 
92/1 1361. Preferred routes of administration for these vectors include all mucosal 
routes. Most preferably, the vectors are administered intranasally or orally. 

Others bacterial strains useful as vaccine vectors are described by High 
etal (EMBO 11:1991, 1992) and Sizemore etal. (Science 270:299, 1995; Shigella 
flexneri); Medaglini et al. (Proc. Natl. Acad. Sci. USA 92:6868, 1995; 
(Streptococcus gordonii); Flynn (Cell. Mol. Biol. 40 (suppl. I):3 1, 1 194), and in 
WO 88/6626, WO 90/0594, WO 91/13157, WO 92/1796, and WO 92/21376 
(Bacille Calmette Guerin). In bacterial vectors, a polynucleotide of the invention 
can be inserted into the bacterial genome or it can remain in a free state, for 
example, carried on a plasmid. 

An adjuvant can also be added to a composition containing a bacterial 
vector vaccine. A number of adjuvants that can be used are known to those skilled 
in the art. For exMtiple, preferred adjuvants can be selected from the list provided 
below. 
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According to a fourth aspect of the invention, there is also provided (i) a 
composition of matter containing a polynucleotide of the invention, together with a 
diluent or carrier; (ii) a pharmaceutical composition containing a therapeutically or 
prophylactically effective amount of a polynucleotide of the invention; (iii) a 
method for inducing an immune response against Helicobacter, in a mammal, by 
administering to the mammal an immunogenically effective amount of a 
polynucleotide of the invention to elicit an immune response, e.g., a protective 
immune response to Helicobacter, and (iv) a method for preventing and/or treating 
Helicobacter {e.g., K pylori, Kfelis, H. mustelae, otH. heilmanii) infection, by 
administering a prophylactic or therapeutic amount of a polynucleotide of the 
invention to an individual in need of such treatment. Additionally, the fourth 
aspect of the invention encompasses the use of a polynucleotide of the invention in 
the preparation of a medicament for preventing and/or treating Helicobacter 
infection. The fourth aspect of the invention preferably includes the use of a 
polynucleotide molecule placed under conditions for expression in a mammalian 
cell, e.g., in a plasmid that is unable to repHcate in mammalian cells and to 
substantially integrate into a mammalian genome. 

Polynucleotides (for example, DNA or RNA molecules) of the invention 
can also be administered as such to a mammal as a vaccine. When a DNA 
molecule of the invention is used, it can be in the form of a plasmid that is unable 
to replicate in a mammalian cell and unable to integrate into the mammalian 
genome. Typically, a DNA molecule is placed under the control of a promoter 
suitable for expression in a mammalian cell. The promoter can function 
ubiquitously or tissue-specifically. Examples of non-tissue specific promoters 
include the early Cytomegalovirus (CMV) promoter (U.S. Patent No. 4,168,062) 
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and the Rous Sarcoma Virus promoter (Norton et al, Molec. Cell Biol. 5:281, 
1985). The desmin promoter (Li et al. Gene 78:243, 1989; Li et al, J. Biol. 
Chem. 266:6562, 1991; Li etal, J, Biol. Chem. 268:10403, 1993) is tissue-specific 
and drives expression in muscle cells. More generally, useful promoters and 
vectors are described, e.g., in WO 94/21797 and by Hartikka et al (Human Gene 
Therapy 7:1205, 1996). 

For DNA/RNA vaccination, the polynucleotide of the invention can 
encode a precursor or a mature form of a polypeptide of the invention. When it 
encodes a precursor form, the precursor sequence can be homologous or 
heterologous. In the latter case, a eucaryotic leader sequence can be used, such as 
the leader sequence of the tissue-type plasminogen factor (tPA). 

A composition of the invention can contain one or several 
polynucleotides of the invention. It can also contain at least one additional 
polynucleotide encoding dinoih^r Helicobacter antigen, such as urease subunit A, 
B, or both, or a firagment, derivative, mutant, or analog thereof. A polynucleotide 
encoding a cytokine, such as interleukin-2 (IL-2) or interleukin-12 (IL-12), can 
also be added to the composition so that the innmune response is enhanced. These 
additional polynucleotides are placed under appropriate control for expression. 
Advantageously, DNA molecules of the invention and/or additional DNA 
molecules to be included in the same composition are carried in the same plasmid. 

Standard methods can be used in the preparation of therapeutic 
polynucleotides of the invention. For example, a polynucleotide can be used in a 
naked form, i&ee of any delivery vehicles, such as anionic liposomes, cationic 
lipids, microparticles, e.g., gold microparticles, precipitating agents, e.g., calcium 
phosphate, or any other transfection-facilitating agent. In this case, the 
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polynucleotide can be simply diluted in a physiologically acceptable solution, such 
as sterile saline or sterile buffered saline, with or without a carrier. When present, 
the carrier preferably is isotonic, hypotonic, or weakly hypertonic, and has a 
relatively low ionic strength, such as provided by a sucrose solution, e.g., a 
5 solution containing 20% sucrose. 

Alternatively, a polynucleotide can be associated with agents that assist 
in cellular uptake. It can be, e.g., (i) complemented with a chemical agent that 
modifies cellular permeability, such as bupivacaine (see, e.g., WO 94/16737), (ii) 
encapsulated into liposomes, or (iii) associated with cationic lipids or silica, gold, 

10 or tungsten microparticles. 

Anionic and neutral liposomes are well-known in the art (see, e.g., 
Liposomes: A Practical Approach, RPC New Ed, IRL Press, 1990, for a detailed 
description of methods for making liposomes) and are useful for delivering a large 
range of products, including polynucleotides. 

15 Cationic lipids can also be used for gene delivery. Such lipids include, 

for example, Lipofectin™, which is also known as DOTMA (N-[l-(2,3- 
dioleyloxy)propyl]-N,N,N-trimethylammonium chloride), DOTAP (1,2- 
bis(oleyloxy)-3 -(trimethylammonio)propane) , DD AB 
(dimethyldioctadecylammonium bromide), DOGS (dioctadecylamidologlycyl 

20 spermine), and cholesterol derivatives. A description of these cationic lipids can 
be found in EP 187,702, WO 90/1 1092, U.S. Patent No. 5,283,185, WO 91/15501, 
WO 95/26356, and U.S. Patent No. 5,527,928. Cationic lipids for gene delivery 
are preferably used in association with a neutral lipid such as DOPE (dioleyl 
phosphatidylethanolamine; WO 90/1 1092). Other transfection-facilitating 

25 compounds can be added to a formulation containing cationic liposomes. A 
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number of them are described in, e.g., WO 93/18759, WO 93/19768, WO 
94/25608, and WO 95/2397. They include, e.g., spermine derivatives useful for 
facilitating the transport of DNA through the nuclear membrane (see, for example, 
WO 93/18759) and membrane-permeabilizing compounds such as GALA, 
5 Gramicidine S, and cationic bile salts (see, for example, WO 93/19768). 

Gold or tungsten microparticles can also be used for gene delivery, as 
described in WO 91/359, WO 93/17706, and by Tang et al (Nature 356:152, 
1992). In this case, the microparticle-coated polynucleotides can be injected via 
intradermal or intraepidermal routes using a needleless injection device ("gene 
1 0 gun"), such as those described in U.S. Patent No. 4,945,050, U.S. Patent 
H No. 5,01 5,580, and WO 94/24263. 

The amoimt of DNA to be used in a vaccine recipient depends, e.g., on 
{I the strength of the promoter used in the DNA construct, the immunogenicity of the 
^ ■ expressed gene product, the condition of the mammal intended for administration 
15 (e.g., the weight, age, and general health of the mammal), the mode of 

administration, and the type of formulation. In general, a therapeutically or 
prophylactically effective dose from about 1 //g to about 1 mg, preferably, from 
ly about 10 Aig to about 800 /^g, and, more preferably, from about 25 /^g to about 250 
Aig, can be administered to human adults. The administration can be achieved in a 
2 0 single dose or repeated at intervals. 

The route of administration can be any conventional route used in the 
vaccine field. As general guidance, a polynucleotide of the invention can be 
administered via a mucosal surface, e.g., an ocular, intranasal, pulmonary, oral, 
intestinal, rectal, vaginal, or urinary tract surface, or via a parenteral route, e.g., by 
25 an intravenous, subcutaneous, intraperitoneal, intradermal, intraepidermal, or 
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intramuscular route. The choice of administration route will depend on, e.g., the 
formulation that is selected. A polynucleotide formulated in association with 
bupivacaine is advantageously administered into muscle. When a neutral or 
anionic liposome or a cationic lipid, such as DOTMA, is used, the formulation can 
be advantageously injected via intravenous, intranasal (for example, by 
aerosolization), intramuscular, intradermal, and subcutaneous routes. A 
polynucleotide in a naked form can advantageously be administered via the 
intramuscular, intradermal, or subcutaneous routes. Although not absolutely 
required, such a composition can also contain an adjuvant. A systemic adjuvant 
that does not require concomitant administration in order to exhibit an adjuvant 
effect is preferable. 

The sequence information provided in the present application enables 
the design of specific nucleotide probes and primers that can be used in diagnostic 
methods. Accordingly, in a fifth aspect of tiie invention, there is provided a 
nucleotide probe or primer having a sequence found in, or derived by degeneracy 
of the genetic code from, a sequence shown in SEQ ID NOs:l or 3, or a 
complementary sequence thereof. 

The term "probe" as used in tiie present application refers to DNA 
(preferably single stranded) or RNA molecules (or modifications or combinations 
thereof) that hybridize under the stiingent conditions, as defined above, to 
polynucleotide molecules having sequences homologous to those shown in SEQ 
ID NOs:l and 3, or to a complementary or anti-sense sequence or SEQ ID NOs:l 
and 3. Generally, probes are significantiy shorter than the full-length sequences 
shown in SEQ ID NOs:l and 3. For example, they can contain from about 5 to 
about 100, preferably from about 10 to about 80 nucleotides. In particular, probes 



have sequences that are at least 75%, preferably at least 85%, more preferably 95% 
homologous to a portion of a sequence as shown in SEQ ID NOs:l and 3 or a 
sequence complementary to such sequences. 

Probes can contain modified bases, such as inosine, metiiyl-5- 
deoxycytidine, deoxyuridine, dimethylamino-5-deoxyuridine, or diamino-2, 6- 
purine. Sugar or phosphate residues can also be modified or substituted. For 
example, a deoxyribose residue can be replaced by a polyamide (Nielsen et al. 
Science 254:1497, 1991) and phosphate residues can be replaced by ester groups 
such as diphosphate, alkyl, arylphosphonate, and phosphorotMoate esters. In 
addition, the 2'-hydroxyl group on ribonucleotides can be modified by addition of, 
e.g., alkyl groups. 

Probes of the invention can be used in diagnostic tests, or as capture or 
detection probes. Such capture probes can be immobilized on solid supports, 
directly or indirectly, by covalent means or by passive adsorption. A detection 
probe can be labeled by a detectable label, for example a label selected fi-om 
radioactive isotopes; enzymes, such as peroxidase and alkaline phosphatase; 
enzymes that are able to hydrolyze a chromogenic, fluorogenic, or luminescent 
substrate; compounds that are chromogenic, fluorogenic, or luminescent; 
nucleotide base analogs; and biotin. 

Probes of the invention can be used in any conventional hybridization 
method, such as in dot blot methods (Maniatis et al.. Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New York, 1982), Southern blot methods (Southern, J. Mol. Biol. 98:503, 1975), 
northern blot methods (identical to Southern blot to the exception tiiat RNA is used 
as a target), or a sandwich method (Dunn et al. Cell 12:23, 1977). As is known in 
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the art, the latter technique involves the use of a specific capture probe and a 
specific detection probe that have nucleotide sequences that are at least partially 
different from each other. 

Primers used in the invention usually contain about 10 to 40 nucleotides 
and are used to initiate enzymatic polymerization of DNA in an amplification 
process (e.g., PGR), an elongation process, or a reverse transcription method. In a 
diagnostic method involving PGR, the primers can be labeled. 

Thus, the invention also encompasses (i) a reagent containing a probe of 
the invention for detecting and/or identifying the presence of Helicobacter in a 
biological material; (ii) a method for detecting and/or identifying the presence of 
Helicobacter in a biological material, in which (a) a sample is recovered or derived 
from the biological material, (b) DNA or RNA is extracted from the material and 
denatured, and (c) the sample is exposed to a probe of the invention, for example, a 
capture probe, a detection probe, or both, under stringent hybridization conditions, 
so that hybridization is detected; and (iii) a method for detecting and/or identifying 
the presence of Helicobacter in a biological material, in which (a) a sample is 
recovered or derived from the biological material, (b) DNA is extracted therefrom, 
(c) the extracted DNA is contacted with at least one, or, preferably two, primers of 
the invention, and amplified by the polymerase chain reaction, and (d) an 
amplified DNA molecule is produced. 

As mentioned above, polypeptides that can be produced by expression 
of the polynucleotides of the invention can be used as vaccine antigens. 
Accordingly, a sixth aspect of the invention features a substantially purified 
polypeptide or polypeptide derivative having an amino acid sequence encoded by a 
polynucleotide of the invention. 



A "substantially purified polypeptide" is defined as a polypeptide that is 
separated from the environment in which it naturally occurs and/or a polypeptide 
that is free of most of the other polypeptides that are present m the environment in 
which it was synthesized. The polypeptides of the invention can be purified from 
a natural source, such as a Helicobacter strain, or can be produced using 
recombinant methods. 

Homologous polypeptides or polypeptide derivatives encoded by 
polynucleotides of the invention can be screened for specific antigenicity by 
testing cross-reactivity with an antiserum raised against a polypeptide having an 
amino acid sequence as shown in SEQ ID N0s:2 or 4. Briefly, a monospecific 
hyperimmune antiserum can be raised against a purified reference polypeptide as 
such or as a fiision polypeptide, for example, an expression product of MBP, GST, 
or His-tag systems, or a synthetic peptide predicted to be antigenic. The 
homologous polypeptide or derivative that is screened for specific antigenicity can 
be produced as such or as a fiision polypeptide. In the latter case, and if the 
antiserum is also raised against a fiision polypeptide, two different fusion systems 
are employed. Specific antigenicity can be determined using a number of 
methods, including Western blot (Towbin et al, Proc. Natl. Acad. Sci. USA 
76:4350, 1979), dot blot, and ELISA methods, as described below. 

In a Western blot assay, the product to be screened, either as a purified 
preparation or a total E. coli extract, is fractionated by SDS-PAGE, as described, 
for example, by Laemmli (Nature 227:680, 1970). After being transferred to a 
filter, such as a nitrocellulose membrane, the material is incubated with the 
monospecific hyperimmune antiserum, which is diluted in a range of dilutions 
from about 1:50 to about 1:5000, preferably from about 1:100 to about 1:500. 



Specific antigenicity is shown once a band corresponding to flie product exhibits 
reactivity at any of the dilutions in the range. 

In an ELISA assay, the product to be screened can be used as the 
coating antigen. A purified preparation is preferred, but a whole cell extract can 
also be used. Briefly, about 100 jul of a preparation of about 10 //g protein/mL is 
distributed into wells of a 96-well ELISA plate. The plate is incubated for about 
2 hours at 37°C, then overnight at 4°C. The plate is washed with phosphate buffer 
saline (PBS) containing 0.05% Tween 20 (PBS/Tween buffer) and the wells are 
saturated with 250 lA PBS containing 1% bovine serum albumin (BSA), to prevent 
non-specific antibody binding. After 1 hour of incubation at 37°C, the plate is 
washed with PBS/Tween buffer. The antiserum is serially diluted in PBS/Tween 
buffer containing 0.5% BSA, and 100 fA dilutions are added to each well. The 
plate is incubated for 90 minutes at 37°C, washed, and evaluated usmg standard 
methods. For example, a goat anti-rabbit peroxidase conjugate can be added to the 
wells when the specific antibodies used were raised in rabbits. Incubation is 
carried out for about 90 minutes at 37 °C and the plate is washed. The reaction is 
developed with the appropriate substrate and the reaction is measured by 
colorimetry (absorbance measured spectrophotometrically). Under these 
experimental conditions, a positive reaction is shown once an O.D. value of 1.0 is 
detected with a dilution of at least about 1:50, preferably of at least about 1:500. 

In a dot blot assay, a purified product is preferred, although a whole cell 
extract can be used. Briefly, a solution of the product at a concentration of about 
100 //g/mL is serially diluted two-fold with 50 mM Tris-HCl (pH 7.5). One 
hundred fA of each dilution is applied to a filter, such as a 0.45 /urn nitrocellulose 
membrane, set in a 96-well dot blot apparatus (Biorad). The buffer is removed by 



applying vacuum to the system. Wells are washed by addition of 50 mM Tris-HCl 
(pH 7.5) and the membrane is air-dried. The membrane is saturated in blocking 
buffer (50 mM Tris-HCl (pH 7.5), 0.15 M NaCl, 10 g/L skim milk) and incubated 
with an antiserum diluted from about 1:50 to about 1:5000, preferably about 1:500. 
The reaction is detected using standard methods. For example, a goat anti-rabbit 
peroxidase conjugate can be added to tiie wells when rabbit antibodies are used. 
Incubation is carried out for about 90 minutes at 2,TQ and the blot is washed. The 
reaction is developed with the appropriate substrate and stopped. The reaction is 
then measured visually by the appearance of a colored spot, e.g., by colorimetry. 
Under these experimental conditions, a positive reaction is associated with 
detection of a colored spot for reactions carried out with a dilution of at least about 
1 :50, preferably, of at least about 1 :500. Therapeutic or prophylactic efficacy of a 
polypeptide or polypeptide derivative of the invention can be evaluated as 
described below. 

According to a seventh aspect of the invention, there is provided (i) a 
composition of matter containing a polypeptide of the invention together with a 
diluent or carrier; (ii) a pharmaceutical composition containing a therapeutically or 
prophylactically effective amount of a polypeptide of the invention; (iii) a method 
for inducing an immune response against Helicobacter in a mammal by 
administering to the mammal an immunogenically effective amount of a 
polypeptide of the invention to elicit an immune response, e.g., a protective 
immune response to Helicobacter, and (iv) a method for preventing and/or treating 
a Helicobacter {e.g., H. pylori, H.felis. H. mustelae, oxH. heilmanii) infection, by 
administering a prophylactic or therapeutic amount of a polypeptide of the 
invention to an individual in need of such treatment. Additionally, this aspect of 
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the invention includes the use of a polypeptide of the invention in the preparation 
of a medicament for preventing and/or treating Helicobacter infection. 

The immunogenic compositions of the invention can be administered by 
any conventional route in use in the vaccine field, for example, to a mucosal {e.g., 
5 ocular, intranasal, pulmonary, oral, gastric, intestinal, rectal, vaginal, or urinary 
tract) surface or via a parenteral (e.g., subcutaneous, intradermal, intramuscular, 
intravenous, or intraperitoneal) route. The choice of the administration route 
depends upon a number of parameters, such as the adjuvant used. For example, if 
a mucosal adjuvant is used, the intranasal or oral route will be preferred, and if a 

1 0 lipid formulation or an aluminum compound is used, a parenteral route will be 
preferred. In the latter case, the subcutaneous or intramuscular route is most 
preferred. The choice of administration route can also depend upon the nature of 
the vaccine agent. For example, a polypeptide of the invention fused to CTB or to 
LTB will be best administered to a mucosal surface. 

15 A composition of the invention can contain one or several polypeptides 

or derivatives of the invention. It can also contain at least one additional 
Helicobacter antigen, such as the urease apoenzyme, or a subunit, fi-agment, 
homolog, mutant, or derivative thereof 

For use in a composition of the invention, a polypeptide or polypeptide 

2 0 derivative can be formulated into or with liposomes, such as neutral or anionic 
liposomes, microspheres, ISCOMS, or virus-like particles (VLPs), to facilitate 
delivery and/or enhance the immune response. These compounds are readily 
available to those skilled in the art; for example, see Liposomes: A Practical 
Approach {supra). Adjuvants other than liposomes can also be used in the 
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invention and are well known in the art (see, for example, the list provided 
below). 

Administration can be achieved in a single dose or repeated as necessary 
at intervals that can be determined by one skilled in the art. For example, a 
priming dose can be followed by three booster doses at weekly or monthly 
intervals. An appropriate dose depends on various parameters, including the 
nature of the recipient (e.g., whether the recipient is an aduh or an infant), the 
particular vaccine antigen, the route and frequency of administration, the 
presence/absence or type of adjuvant, and the desired effect (e.g., protection and/or 
treatment), and can be readily determined by one skilled in the art. In general, a 
vaccine antigen of the invention can be administered mucosally in an amount 
ranging from about 10 //g to about 500 mg, preferably from about 1 mg to about 
200 mg. For a parenteral route of administration, the dose usually should not 
exceed about 1 mg, and is, preferably, about 100 /j,g. 

When used as components of a vaccine, the polynucleotides and 
polypeptides of the invention can be used sequentially as part of a multi-step 
immunization process. For example, a mammal can be initially primed with a 
vaccine vector of the invention, such as a pox virus, e.g., via a parenteral route, 
and then boosted twice with a polypeptide encoded by the vaccine vector, e.g., via 
the mucosal route. In another example, liposomes associated with a polypeptide or 
polypeptide derivative of the invention can be used for priming, with boosting 
being carried out mucosally using a soluble polypeptide or polypeptide derivative 
of the invention, in combination with a mucosal adjuvant (e.g., LT). 

Polypeptides and polypeptide derivatives of the invention can also be 
used as diagnostic reagents for detecting the presence of an'd-Helicobacter 
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antibodies, e.g., in blood samples. Such polypeptides can be about 5 to about 80, 
preferably, about 10 to about 50 amino acids in length and can be labeled or 
unlabeled, depending upon the diagnostic method. Diagnostic methods involving 
such a reagent are described below. 

Upon expression of a polynucleotide molecule of the invention, a 
polypeptide or polypeptide derivative is produced and can be purified using known 
methods. For example, the polypeptide or polypeptide derivative can be produced 
as a fusion protein containing a fused tail that facilitates purification. The fusion 
product can be used to immunize a small mzunmal, e.g., a mouse or a rabbit, in 
order to raise monospecific antibodies against the polypeptide or polypeptide 
derivative. The eighth aspect of the invention thus provides a monospecific 
antibody that binds to a polypeptide or polypeptide derivative of the invention. 

By "monospecific antibody" is meant an antibody that is capable of 
reacting witii a unique, naturally-occurring Helicobacter polypeptide. An antibody 
of the invention can be polyclonal or monoclonal. Monospecific antibodies can be 
recombinant, e.g., chimeric (e.g., consisting of a variable region of murine origin 
and a himian constant region), humanized (e.g., a human immunoglobulin constant 
region and a variable region of animal, e.g., murine, origin), and/or single chain. 
Botii polyclonal and monospecific antibodies can also be in the form of 
immunoglobulin fi-agments, e.g., F(ab)'2 or Fab firagments. The antibodies of the 
invention can be of any isotype, e.g., IgG or IgA, and polyclonal antibodies can be 
of a single isotype or can contain a mixture of isotypes. 

The antibodies of the invention, which can be raised to a polypeptide or 
polypeptide derivative of the invention, can be produced and identified using 
standard immunological assays, e.g., Westem blot assays, dot blot assays, or 




ELISA (see, e.g., Coligan et al.. Current Protocols in Immunology, John Wiley & 
Sons, Inc., New York, NY, 1994). The antibodies can be used in diagnostic 
methods to detect the presence of Helicobacter antigens in a sample, such as a 
biological sample. The antibodies can also be used in affinity chromatography 
5 methods for purifying a polypeptide or polypeptide derivative of the invention. As 
is discussed further below, the antibodies can also be used in prophylactic and 
therapeutic passive immunization methods. 

Accordingly, a ninth aspect of the invention provides (i) a reagent for 
detecting the presence of Helicobacter in a biological sample that contains an 

1 0 antibody, polypeptide, or polypeptide derivative of the invention; and (ii) a 
diagnostic method for detecting the presence of Helicobacter in a biological 
sample, by contacting the biological sample with an antibody, a polypeptide, or a 
polypeptide derivative of the invention, so that an immune complex is formed, and 
detecting the complex as an indication of the presence of Helicobacter in the 

15 sample or the organism from which the sample was derived. The immune 
complex is formed between a component of the sample and the antibody, 
polypeptide, or polypeptide derivative, and that any unboimd material can be 
removed prior to detecting the complex. A polypeptide reagent can be used for 
detecting the presence of anti-Helicobacter antibodies in a sample, e.g., a blood 

2 0 sample, while an antibody of the invention can be used for screening a sample, 
such as a gastric extract or biopsy sample, for tihe presence of Helicobacter 
polypeptides. 

For use in diagnostic methods, the reagent (e.g., the antibody, 
polypeptide, or polypeptide derivative of the invention) can be in a free state or 
2 5 can be immobilized on a soUd support, such as, for example, on the interior surface 



of a tube or on the surface, or within pores, of a bead. Immobilization can be 
achieved using direct or indirect means. Direct means include passive adsorption 
{Le., non-covalent binding) or covalent binding between the support and the 
reagent. By "indirect means" is meant that an anti-reagent compound that interacts 
with the reagent is first attached to the solid support. For example, if a polypeptide 
reagent is used, an antibody that binds to it can serve as an anti-reagent, provided 
that it binds to an epitope that is not involved in recognition of antibodies in 
biological samples. Indirect means can also employ a ligand-receptor system, for 
example, a molecule, such as a vitamin, can be grafted onto the polypeptide 
reagent and the corresponding receptor can be immobilized on the solid phase. 
This concept is illustrated by the well known biotin-streptavidin system. 
Alternatively, indirect means can be used, e.g., by adding to the reagent a peptide 
tail, chemically or by genetic engineering, and immobilizing the grafted or fiised 
product by passive adsorption or covalent linkage of the peptide tail. 

According to a tenth aspect of the invention, there is provided a process 
for purifying, from a biological sample, a polypeptide or polypeptide derivative of 
the invention, which involves carrying out antibody-based affinity chromatography 
with the biological sample, wherein the antibody is a monospecific antibody of the 
invention. 

For use in a purification process of the invention, the antibody can be 
polyclonal or monospecific, and preferably is of the IgG type. Purified IgGs can 
be prepared from an antiserum using standard methods (see, e.g., Coligan et al., 
supra). Conventional chromatography supports, as well as standard methods for 
grafting antibodies, are described, for example, by Harlow et al. (Antibodies: A 



Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New York, 1988). 

Briefly, a biological sample, such as an K pylori extract, preferably in a 
buffer solution, is applied to a chromatography material, which is, preferably, 
equilibrated with the buffer used to dilute the biological sample, so that the 
polypeptide or polypeptide derivative of the invention (z.e., the antigen) is allowed 
to adsorb onto the material. The chromatography material, such as a gel or a resin 
coupled to an antibody of the invention, can be in batch form or in a column. The 
unbound components are washed off and the antigen is eluted with an appropriate 
elution buffer, such as a glycine buffer, a buffer containing a chaotropic agent, 
e.g., guanidine HCl, or a buffer having high salt concentration (e.g., 3 M MgCl2). 
Eluted fractions are recovered and the presence of the antigen is detected, e.g., by 
measuring the absorbance at 280 nm. 

An antibody of the invention can be screened for therapeutic efficacy as 
follows. According to an eleventh aspect of the invention, there is provided (i) a 
composition of matter containing a monospecific antibody of the invention, 
together with a diluent or carrier; (ii) a pharmaceutical composition containing a 
therapeutically or prophylactically effective amount of a monospecific antibody of 
the invention, and (iii) a method for treating or preventing Helicobacter {e.g., 
K pylori, H.felis, K mustelae, orK heilmanii) infection, by administering a 
therapeutic or prophylactic amount of a monospecific antibody of the invention to 
an individual in need of such treatment. In addition, the eleventh aspect of the 
invention includes the use of a monospecific antibody of the invention in the 
preparation of a medicament for treating or preventing Helicobacter infection. 
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The monospecific antibody can be polyclonal or monoclonal, and is, 
preferably, predominantly of the IgA isotype. In passive immunization methods, 
the antibody is administered to a mucosal surface of a mammal, e.g., the gastric 
mucosa, e.g., orally or intragastrically, optionally, in the presence of a bicarbonate 
buffer. Alternatively, systemic administration, not requiring a bicarbonate buffer, 
can be carried out. A monospecific antibody of the invention can be administered 
as a single active agent or as a mixture with at least one additional monospecific 
antibody specific for a different Helicobacter polypeptide. The amount of 
antibody and the particular regimen used can be readily determined by one skilled 
in the art. For example, daily administration of about 100 to 1,000 mg of antibody 
over one week, or three doses per day of about 100 to 1,000 mg of antibody over 
two or three days, can be effective regimens for most purposes. 

Therapeutic or prophylactic efficacy can be evaluated using standard 
methods in the art, e.g., by measuring induction of a mucosal immune response or 
induction of protective and/or therapeutic immunity, using, e.g., the Kfelis mouse 
model and the procedures described by Lee et al. (Eur. J. Gastroenterology & 
Hepatology 7:303, 1995) or Lee etal. (J. Infect. Dis. 172:161, 1995). Those 
skilled in the art will recognize that the Kfelis strain of the model can be replaced 
with another Helicobacter strain. For example, the efficacy of polynucleotide 
molecules and polypeptides from H. pylori is, preferably, evaluated in a mouse 
model using an H pylori strain. Protection can be determined by comparing the 
degree of Helicobacter infection in the gastric tissue assessed by, for example, 
urease activity, bacterial counts, or gastritis, to that of a control group. Protection 
is shown when infection is reduced by comparison to the control group. Such an 



evaluation can be made for polynucleotides, vaccine vectors, polypeptides, and 
polypeptide derivatives, as well as for antibodies of the invention. 

For example, various doses of an antibody of the invention can be 
administered to the gastric mucosa of mice previously challenged v^ith an K pylori 
strain, as described, e.g., by Lee et al {supra). Then, after an appropriate period of 
time, the bacterial load of the mucosa can be estimated by assessing urease 
activity, as compared to a control. Reduced urease activity indicates that the 
antibody is therapeutically effective. 

Adjuvants that can be used in any of the vaccine compositions described 
above are described as follows. Adjuvants for parenteral administration include, 
for example, aluminum compounds, such as aluminum hydroxide, aluminum 
phosphate, and aluminum hydroxy phosphate. The antigen can be precipitated 
with, or adsorbed onto, the aluminum compound using standard methods. Other 
adjuvants, such as RIBI (ImmunoChem, Hamilton, MT), can also be used in 
parenteral administration. 

Adjuvants that can be used for mucosal administration include, for 
example, bacterial toxins, e.g., the cholera toxin (CT), the E. coli heat-labile toxin 
(LT), the Clostridium difficile toxin A, the pertussis toxin (PT), and combinations, 
subunits, toxoids, or mutants thereof. For example, a purified preparation of native 
cholera toxin subunit B (CTB) can be used. Fragments, homologs, derivatives, 
and fusions to any of these toxins can also be used, provided that they retain 
adjuvant activity. Preferably, a mutant having reduced toxicity is used. Suitable 
mutants are described, e.g., in WO 95/1721 1 (Arg-7-Lys CT mutant), WO 96/6627 
(Arg-192-Gly LT mutant), and WO 95/34323 (Arg-9-Lys and Glu-129-Gly PT 
mutant). Additional LT mutants that can be used in the methods and compositions 




of the invention include, e.g., Ser-63-Lys, Ala-69-Gly, Glu-1 10-Asp, and Glu-1 12- 
Asp mutants. Other adjuvants, such as the bacterial monophosphoryl lipid A 
(MPLA) of, e.g., E. coli, Salmonella minnesota, Salmonella typhimurium, or 
Shigella flexneri; saponins, and polylactide glycolide (PLGA) microspheres, can 
5 also be used in mucosal administration. Adjuvants useful for both mucosal and 
parenteral administrations, such as polyphosphazene (WO 95/2415), can also be 
used. 

Any pharmaceutical composition of the invention, containing a 
polynucleotide, polypeptide, polypeptide derivative, or antibody of the invention, 

1 0 can be manufactured using standard methods. It can be formulated with a 

pharmaceutically acceptable diluent or carrier, e.g., water or a saline solution, such 
as phosphate buffer saline, optionally, including a bicarbonate salt, such as sodium 
bicarbonate, e.g., 0.1 to 0.5 M. Bicarbonate can advantageously be added to 
compositions intended for oral or intragastric administration. In general, a diluent 

15 or carrier can be selected on the basis of the mode and route of administration, and 
standard pharmaceutical practice. Suitable pharmaceutical carriers and diluents, as 
well as pharmaceutical necessities for their use in pharmaceutical formulations, are 
described in Remington's Pharmaceutical Sciences, a standard reference text in 
this field and in the USP/NF. 

2 0 The invention also includes methods in which gastroduodenal 

infections, such as Helicobacter infection, are treated by oral administration of a 
Helicobacter polyiptiptidQ of the invention and a mucosal adjuvant, in combination 
with an antibiotic, an antisecretory agent, a bismuth salt, an antacid, sucralfate, or a 
combination thereof. Examples of such compounds that can be administered with 

25 the vaccine antigen and an adjuvant are antibiotics, including, e.g., macrolides. 



tetracyclines, p-lactams, aminoglycosides, quinolones, penicillins, and derivatives 
thereof (specific examples of antibiotics that can be used in the invention include, 
e.g., amoxicillin, clarithromycin, tetracycline, metronidizole, erythromycin, 
cefuroxime, and erythromycin); antisecretory agents, including, e.g., H2-receptor 
5 antagonists (e.g., cimetidine, ranitidine, famotidine, nizatidine, and roxatidine), 
proton pump inhibitors (e.g., omeprazole, lansoprazole, and pantoprazole), 
prostaglandin analogs (e.g., misoprostil and enprostil), and anticholinergic agents 
(e.g., pirenzepine, telenzepine, carbenoxolone, and proglumide); and bismuth salts, 
including colloidal bismuth subcitrate, tripotassium dicitrate bismuthate, bismuth 

10 subsalicylate, bicitropeptide, and pepto-bismol (see, e.g., Goodwin et aL, 

Helicobacter pylori. Biology and Clinical Practice, CRC Press, Boca Raton, FL, 
pp 366-395, 1993; Physicians' Desk Reference, 49* edn., Medical Economics Data 
Production Company, Montvale, New Jersey, 1995). In addition, compounds 
containing more than one of the above-listed components coupled together, e.g., 

1 5 ranitidine coupled to bismuth subcitrate, can be used. The invention also includes 
compositions for carrying out these methods, i.e., compositions containing a 
Helicobacter antigen (or antigens) of the invention, an adjuvant, and one or more 
of the above-listed compounds, in a pharmaceutically acceptable carrier or diluent. 
Amounts of the above-listed compounds used in the methods and 

2 0 compositions of the invention can readily be determined by one skilled in the art. 
In addition, one skilled in the art can readily design treatment/immunization 
schedules. For example, the non-vaccine components can be administered on days 
1-14, and the vaccine antigen + adjuvant can be administered on days 7, 14, 21, 
and 28. 
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Methods and pharmaceutical compositions of the invention can be used 
to treat or to prevent Helicobacter infections and, accordingly, gastroduodenal 
diseases associated with these infections, including acute, chronic, and atrophic 
gastritis, and peptic ulcer diseases, e.g., gastric and duodenal ulcers. 
5 GHPO 1360 and GHPO 750 polypeptides were purified from 

Helicobacter pylori strain ATCC number 43579 (American Type Culture 
Collection, Rockville, Maryland) by inmiunoaffmity-based chromatography using 
the methods described below in Example 1, and the purified proteins were shown 
to be effective vaccine antigens as follows. 
,10 Groups of 10 mice each were orally immunized with 1, 5, or 25 ixg of 

\i either purified GHPO 1360 or GHPO 750, in combination with 5 /ig of the heat- 
W labile enterotoxin (LT) ofE. coli. Twenty five fxg of recombinant urease, in 
U combination with 5 jbtg LT, was used as a positive control, and 5 jbtg of LT in PBS 

was used as a negative control. The immunizations were carried out four times 
=^ 15 each, on days 0, 7, 14, and 21 of the experiment. On day 33, blood samples were 
collected from the mice and, on day 34, saliva samples were collected. On day 35, 
all of the mice were challenged by intragastric administration of 1 x 10^ 
^ - streptomycin-resistant, mouse-adapted K pylori. On day 49, additional saliva 
samples were collected and, about two weeks after challenge, on days 52-53, the 
2 0 mice were sacrificed. Stomachs were removed from the mice and were analyzed 
for Helicobacter infection by measuring urease activity in the intact stomach tissue 
and by a quantitative culture study (Table 1). 

Briefly, these studies showed that the gastric urease activities in samples 
from mice immunized with all three amounts of GHPO 1360 or GHPO 750 {i.e., 1, 
25 5, and 25 /^g), in combination with LT, were generally lower than the gastric 
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urease activities of samples from mice immunized with LT alone or mice that were 
not treated prior to challenge. Levels of gastric urease activity generally decreased 
with increasing amounts of GHPO 1360 and GHPO 750 administered, with the 
gastric urease activity levels for the 25 /ug doses of GHPO 1360 and GHPO 750 
5 generally approaching those of mice immunized with 25 /ig of recombinant urease 
and LT. 

The quantitative culture analyses showed that the levels of Helicobacter 
detected in the stomachs of mice immunized with GHPO 1360 and GHPO 750, 
which generally decreased with increasing dosages, were less than the levels 
,10 detected in the stomachs of control mice that were immunized with LT alone or 
O untreated before Helicobacter challenge (Table 1). The percentages of mice 

protected by immunization with GHPO 1 360 and GHPO 750 met or approached 
1^ the percentages of mice protected by treatment with urease (Table 1). These 
y results show that GHPO 1360 and GHPO 750 are effective vaccine antigens for 
Q 1 5 use in preventing Helicobacter infection. 
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The invention is further illustrated by the following examples. Example 
1 describes purification of GHPO 1360 and GHPO 750 &om Helicobacter 
cultures. Example 2 describes identification of genes, such as genes that encode 
GHPO 1360 and GHPO 750, in the Helicobacter genome, as well as identification 
5 of leader sequences, and primer design for amplification of genes lacking signal 
sequences. Example 3 describes cloning of DNA encoding GHPO 1360 and 
GHPO 750 into a vector that provides a histidine tag, and production and 
purification of the resulting his-tagged fusion proteins. Example 4 describes 
methods for cloning DNA encoding GHPO 1360 and GHPO 750 so that they can 
10 be produced without his-tags, and Example 5 describes methods for purifying 
recombinant GHPO 1360 and GHPO 750. 

EXAMPLE 1: Purification and partial sequence analysis of GHPO 1360 and 
GHPO 750 from Helicobacter pylori 

l.A« Culture and initial purification steps 

15 Frozen seeds from H. pylori strain ATCC 43579 are used to seed a 

75 cm^ flask containing a biphasic medium (a solid phase made of Colombia 
gelose containing 6% fresh sheep blood and a liquid phase made of triptcase soja 
containing 20% fetal calf serum). After 24 hours of culture under microaerophilic 
conditions, the liquid phase is used for seeding several 75 cm^ flasks containing 

2 0 biphasic mediirai that lacks sheep blood. After 24 hours of culture, the liquid 
phase is used to seed a 2 L biofermentor in triptcase soja liquid phase containing 
10 g/L beta-cyclodextrine. At OD 1.5-1 .8, this culture is diluted into a 10 L 
biofermentor containing the liquid medium. After 24 hours, the bacteria are 
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centrifuged at 4,000 x g for 30 minutes at 4''C. A 10 L culture contains about 20 
to 30 g (wet weight) bacteria. 

The pellet obtained as described above is washed with 500 mL PBS 
(Phosphate Buffered Saline: 7.650 g NaCl, 0.724 g disodium phosphate, and 0.210 
5 g monopotassium phosphate in one liter (pH 7.2)) for a 1 L culture. The bacteria 
are then centrifuged again under the same conditions. 

The pellet (CI) is suspended in 1% N-octyl-D-glucopyranoside (OG; 
30 mL/L; Sigma). The bacterial suspension is incubated for 1 hour at room 
temperature under stirring, is centrifuged at 17,600 x g for 30 minutes at 4*^C, and 
^10 the pellet (C2) is recovered. 

O The supernatant (S2) is dialyzed against PBS ovemight at 4''C under stirring. 

Ill 

The precipitate is recovered by centrifiigation at 2,600 x g for 30 minutes at 4°C. 
S The supematant (S2d) is discarded and the pellet (Cs2d) is recovered and stored at 

y -20°c. 

U 15 The pellet (C2) is resuspended in 20 mM Tris-HCl buffer (pH 7.5) and 100 

laM Pefabloc (Buffer A), and is homogenized with an ultra-turrax (382 1 , Janke and 
^ Kungel). Lysozyme and EDTA are added at 0.1 mg/mL and 1 mM, respectively. 
W The homogenate is sonicated three times for 2 minutes each at 4°C, and then 

is spun in an ultracentrifiige at 210,000 x g for 30 minutes at 4''C. The supematant 
2 0 (S3), which contains the cytoplasmic and periplasmic proteins, is eliminated, and 
the pellet is recovered, washed with buffer A, and spun in an ultracentrifiige at 
210,000 X g for 30 minutes at 4°C. The supematant (S4) is eliminated and the 
pellet (C4) is stored at -20°C. This pellet (C4) contains membrane proteins. 
The C4 pellet is washed in 50 mM NaC03 (pH 9.5) and 100 ^M Pefabloc 
2 5 (buffer B). The suspension is spun in an ultracentrifiige at 210,000 x g for 
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30 minutes at 4°C. The supernatant (S5) is eliminated and then the pellet (C5) is 
washed and spun in an ultracentrifuge as is described above. The supematant (S6) 
is eliminated and the pellet (C6) is stored at -20°C. 

I.B. Purification of the proteins of membrane fraction C4 by preparative 
5 SDS-PAGE 

SDS-PAGE is carried out according to the method of Laemmli {supra) using 
a biphasic gel consisting of a 5% polyacrylamide concentrating gel and a 10% 
polyacrylamide separating gel. The membrane fraction C4 is resuspended in 
buffer A, diluted in an equal volume of 2x sample buffer, and heated for 5 minutes 
- 3 10 at 95^C. About 19 mg of protein is applied to the gel (16 x 12 cm; 5 mm 

thickness). Pre-migration is carried out for 2 hours at 50 V, and is followed by an 
^ overnight migration at 65 V. After Coomassie blue staining, five major bands are 
y revealed that have apparent molecular weights of 87, 76, 54, 50, and 32 kDa. The 
Q bands at 50 and 32 kDa appear to be shghtly contaminated with bands at 47 and 35 
15 kDa, respectively. 

Bands corresponding to the 50 and 32 kDa proteins are cut out of the gel and 
?U are pounded with an ultra-turrax in 10-20 mL extraction buffer (25 mM Tris-HCl 
(pH 8.8), 8 M urea, 10% SDS, 100 \M phenyl methyl sulfonyl fluoride (PMSF), 
and 10 Pefabloc (buffer C)). 
2 0 Each homogenate is filtered through a Millipore AP20 filter under 7 bars at 

room temperature, washed with 5-10 mL buffer C, and then filtered again. Each 
filtrate is precipitated with three volumes of a 50/50 mixture of 75% methanol and 
75% isopropanol, and then is centrifuged at 240,000 x g for 16 hours at 10°C. 
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Each pellet is resuspended in 2 mL 10 mM NaPO, (pH 7.0), containing 1 M 
NaCl, 0.1 % Sarkosyl, 100 \M PMSF, and 6 M urea (buffer D). The solubilized 
sample is dialyzed, in order, against 100 mL buffer D containing 4 M urea, 
100 mL buffer D containing 2 M urea and 0.5 % Sarkosyl, and twice against 
5 100 mL buffer D that is urea and Sarkosyl free. The dialyses are each carried out 
for 1 hour imder stirring at room temperature. The last dialysate is incubated for 
30 minutes in an ice bath, and then is centrifuged at low speed for 10 minutes at 
4°C. The supematant is recovered, filtered through a Millipore filter (0.45 \im), 
and stored at -20°C. 

10 I.e. Purification of the 50 or 32 kDa protein by immunoaffinity-based 
chromatography 

l.C.l. Antiserum preparation 

Specific polyclonal serum against the 50 or 32 kDa protein that is purified 
by prq)arative SDS-PAGE is prepared by hyperimmunizing rabbits as follows. 
15 On day 0, a preparation containing 50 ng of the protein mixed with complete 

Freund's adjuvant is subcutaneously administered to rabbits at multiple sites. The 
rabbits are boosted at days 21 and 42 with 25 |j,g of the protein in incomplete 
Freund's adjuvant, and are sacrificed at day 60. Complement is removed from the 
rabbit serum by heating for 30 minutes at 56°C. The hyperimmune serum is then 
2 0 sterilized by filtration through a Millipore membrane (0.22 \im). 
l.C.2. IgG purification 

The hyperimmune serum prepared as described above is applied to a 
Protein A Sepharose Fast Flow column (Pharmacia), which is equilibrated with 
100 mM Tris-HCl (pH 8.0). The column is washed witii 10 column volumes of 
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100 mM Tris-HCl (pH 8.0), and then with 10 column volumes of 10 mM Tris-HCl 
(pH 8.0). IgGs are eluted with 0.1 M glycine buffer (pH 3.0), and are collected in 
5 mL fractions, to each of which 0,25 mL Tris-HCl (pH 8.0) is added. The optical 
density of each fraction is measured at 280 nm, the IgG-containing fractions are 
5 pooled together, and, if necessary, frozen at -70°C. 
l.C.3. Preparation of the column 

An appropriate amount of CNBr-activated Sepharose 4B gel (Pharmacia; 
reference: 17-0430-01) is suspended in 1 mM NaCl buffer (1 g dry gel provides for 
3.5 mL hydrated gel; 5 to 10 mg IgGs can be retained per mL of hydrated gel). 
,10 The gel is then washed with a buchner by adding small quantities of 1 mM HCl. 
^ The total volume of 1 mM HCl that is used amounts to 200 mL/g of gel. 
L Purified IgGs are dialyzed for 4 hours at room temperature against 

. - 50 volumes of 500 mM sodium phosphate buffer (pH 7.5), and then are diluted to 
: :J 3 mg/mL with the same buffer. IgGs are then incubated with the gel ovemight at 

15 5^3"^ C under stirring. The gel is packed in a chromatography column and is 
n washed with 2 column volumes of 500 mM phosphate buffer (pH 7.5). The gel is 
then transferred to a tube, incubated with 100 mM ethanolamine (pH 7.5), and then 
washed with 2 column volumes of PBS. The gel can be stored in PBS/merthiolate, 
1/10,000. 

20 l.C.4. Adsorption and elution 

The 50 or 32 kDa protein is purified by immunoaffinity-based 
chromatography as follows. In order to separate the 50 or 32 kDa protein fi:om the 
contaminating proteins (the 47 and 35 kDa proteins, respectively), membrane 
fraction C4 is solubilized in 50 mM NaCOa (pH 9.5) for 30 minutes at room 
2 5 temperature under stirring and the preparation is centrifuged for 30 minutes at 
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200,000 X g at The 47 and 35 kDa proteins are insoluble in the NaCOj buffer 

and are eliminated in the pellet 

The supernatant is dialyzed against 50 mM Tris-HCL (pH 8.0), 2 mM 

EDTA, and then is filtered through a 0.45 |im membrane. The filtered supernatant 
5 is applied to the column, which is equilibrated with 50 mM Tris-HCL (pH 8.0), 

2 mM EDTA, at a flow rate of about 10 mL/hoxir. The column is washed with 

20 column volumes of 50 mM Tris-HCL (pH 8.0), 2 mM EDTA, and then with 2 

to 6 volumes of 10 mM phosphate buffer (pH 6.8). 

The antigen is eluted with 100 mM glycine buffer (pH 2.5). The eluate is 
10 collected in 3 mL fractions, to each of which is added 150 |liL 1 M phosphate 

buffer (pH 8.0). The optical density of each fraction is measured at 280 nm, and 

fractions containing the 50 or 32 kDa protein are pooled and stored at -70^C. 

Analysis of the purified protein by 10% SDS-PAGE reveals a single band at 

50 or 32 kDa. N-terminal sequencing is carried out with the purified 50 kDa 
15 protein preparation. The sequence found is as follows: 

MKEKFNRTKPHVNIGTIGHVDH (SEQ ID N0:9). 

Similarly, N-terminal and internal sequencing is carried out with the purified 

32 kDa preparation. The sequences found are as follows: AHNANNATHNTKK 

(SEQ ID NO:10) and KPAHNA (SEQ ID N0:11) (N-terminal), and 
2 0 IDKQPKAKK (SEQ ID NO: 12) and FWAKKQAE (SEQ ID NO: 13) (intemal). 

I.D. Purification of the 32 kDa protein from membrane fraction C4 

The 32 kDa protein can also be purified as follows. Membrane firaction C4 
is solubilized in 50 mM NaCOj buffer (pH 9.5) at room temperature for 30 minutes 
under stirring. The suspension is then centrifuged at 200,000 x g for 30 minutes at 
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4°C. This allows the 32 and 35 kDa proteins to be separated, since the 35 kDa 
protein is insoluble in the NaCOs buffer. The supernatant is dialyzed against 50 
mM NaPO^ buffer (pH 7.0), and then is applied to an SP-Sepharose column, 
which is equilibrated with the NaPO^ buffer. The column is washed with the 
5 NaPO^ buffer, and then an 0-0.5 M NaCl gradient is applied to the column. The 
fraction eluted between 0.26 and 0.31 M contains the 32 kDa protein. 

l.£. Purification of the 50 kDa protein from membrane fraction C4 

The 50 kDa protein can also be purified as follows. Membrane fi-action C4 

is solubilized in 50 mM NaCOj buffer (pH 9.5) at room temperature for 30 minutes 
1 0 under stirring. The suspension is then centrifuged at 200,000 x g for 30 minutes at 

4°C. This allows the 50 and 47 kDa proteins to be separated, since the 47 kDa 

protein is insoluble in the NaCOs buffer. The supematant is dialyzed against 50 

mM NaPO, buffer (pH 7.0). 

A 40 mL Q-Sepharose colunm (diameter: 2.5 cm; height: 8 cm) is prepared 
1 5 according to the manufacturer's instructions (Pharmacia), washed, and equilibrated 

with buffer B (pH 9.5) (50 mM NaCOj, 100 ^iM Pefabloc, and 0.1% Zwittergent 

3-14). 

The chromatography is monitored by UV detection at 280 nm at the colunm 
exit. One hundred and forty mg of protein solubilized as described above are 
2 0 applied to the column, which is then washed with buffer B until the absorbance at 
280 nm is stabilized. The proteins are eluted with a 0.1-0.5 M NaCl gradient in 
buffer B (10 fold V,.), which is followed by washing in buffer B containing 0.5, 
and then 1, M NaCl (2 fold V^). The fractions are recovered, analyzed by SDS- 
PAGE, and pooled according to their electrophoretic profiles. 
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Fraction 9, which corresponds to the beginning of the washing at 1 M NaCl 
and contains acidic proteins, is further purified as follows. A 10 mL DEAE 
Sepharose column (diameter: 1.5 cm, height: 5 cm) is prepared according to the 
manufacturer's instructions (Pharmacia) (up to 10 mg protein/mL of gel). The 
5 column is washed and equilibrated with buffer B. Chromatography is monitored 
as is described above. 

Fraction 9 is dialyzed against buffer B and contains about 10 mg protein. 
Fraction 9 is applied to the DEAE-Sepharose column. The column is washed with 
buffer B until the absorbance at 280 nm is stabilized. The proteins are eluted with 
10 a 0-0.5 M NaCl gradient in buffer B (10 fold V^), followed by washing in buffer B, 
containing 1 M NaCl (2 fold V^). Fractions are recovered and analyzed by SDS- 
PAGE. The 50 kDa protein is found in the fractions eluted at 0.3-0.4 M NaCl. 

EXAMPLE 2: Identification of genes, such as genes encoding GHPO 1360 
and GHPO 750, in the H. pylori genome, identification of leader sequences, 
15 and primer design for amplification of genes lacking signal sequences 

Creating H. pylori genomic databases 

The K pylori genome was provided as a text file containing a single 
contiguous string of nucleotides that had been determined to be 1 .76 Megabases in 
length. The complete genome was split into 17 separate files using the program 
2 0 SPLIT (Creativity in Action), giving rise to 16 contigs, each containing 100,000 
nucleotides, and a 17* contig containing the remaining 76,000 nucleotides. A 
header was added to each of the 17 files using the format: >hpg0.txt (representing 
contig 1), .hpgl.txt (representing contig 2), etc. The resulting 17 files, named 

- 53 - 



hpgO through hpgl6, were then copied together to form one file that represented 
the plus strand of the complete K pylori genome. The constructed database was 
given the designation "H." A negative strand database of the K pylori genome 
was created similarly by first creating a reverse complement of the positive strand 
using the program SeqPup (D.G. Gilbert, Indiana University Biology Department) 
and then performing the same procedure as described above for the plus strand 
This database was given the designation **N." 

The regions predicted to encode open reading firames (ORFs) were defined 
for the complete K pylori genome using the program GENEMARK™ 
(Borodovsky et al, Comp. Chem. 17:123, 1993). A database was created from a 
text file containing an annotated version of all ORFs predicted to be encoded by 
the H. pylori genome for both the plus and minus strands, and was given the 
designation "O." Each ORE was assigned a number indicating its location on the 
genome and its position relative to other genes. No manipulation of the text file 
was required. 

2.B. Searching the H. pylori databases 

The databases constructed as is described above were searched using the 
program PASTA (Pearson et al, Proc, Natl. Acad. Sci. USA 85:2444-2448, 1988). 
FASTA was used for searching either a DNA sequence against either of the gene 
databases ("H" and/or "N"), or a peptide sequence against the ORE library ("O"). 
TFASTX was used to search a peptide sequence against all possible reading 
frames of a DNA database ("H" and/or libraries). Potential frameshifls also 
being resolved, FASTX was used for searching the translated reading frames of a 



- 54 - 



DNA sequence against either a DNA database, or a peptide sequence against the 
protein database. 

2.C, Isolation of DNA sequences from the H. pylori genome 

The FASTA searches against the constructed DNA databases identified 
5 exact nucleotide coordinates on one or more of the isolated contigs, and therefore 
the location of the target DNA. Once the exact location of the target sequence was 
known, the contig identified to carry the gene was exported into the software 
package MapDraw (DNAStar, Inc.) and the gene was isolated. Gene sequences 
with flanking DNA was then excised and copied into the EditSeq. Software 
1 0 package (DNAStar, Inc.) for further analysis. 

2.D. Identification of leader sequences 

The deduced protein encoded by a target gene sequence is analyzed using 
the PROTEAN software package (DNAStar, Inc.). This analysis predicts those 
areas of the protein that are hydrophobic by using the Kyte-Doolittle algorithm, 

1 5 and identifies any potential polar residues preceding the hydrophobic core region, 
which is typical for many leader sequences. For confirmation, the target protein is 
then searched against a PROSITE database (DNAStar, Inc.) consisting of motifs 
and signatures. Characteristic of many leader sequences and hydrophobic regions 
in general, is the identification of predicted prokaryotic lipid attachment sites. 

2 0 Where confirmation between the two approaches is apparent at the N-terminus of 
any protein, putative cleavage sites are sought Specifically, this includes the 
presence of either an Alanine (A), Serine (S), or Glycine (G) residue immediately 
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after the core hydrophobic region. In the case of lipoproteins, a Cysteine (C) 
residue would be identified as the +1 residue, post-cleavage. 

Rational design of PCR primers based on the identification of leader 
sequences 

In order to clone gene sequences as N-terminus translational fusions for the 
generation of recombinant proteins with N-terminal Histidine tags, the gene 
sequence that specifies the leader sequence is omitted. The 5 -end of the gene- 
specific portion of the N-terminal primer is designed to start at the first codon 
beyond the cleavage site. In the case of lipoproteins, the 5 - end of the N-terminal 
primer begins at the second codon, immediately after the modifiable residue at 
position +1 post-cleavage. The omission of the leader sequence from the 
recombinant allows for one-step purification, and potential problems associated 
with insertion of leader sequences in the membrane of the host strain carrying the 
hybrid construct are avoided. 

EXAMPLE 3: Preparation of isolated DNA encoding GHPO 1360 and 
GHPO 750, and production of GHPO 1360 and GHPO 750 as histidine-tagged 
fusion proteins 

3.A. Preparation of genomic DNA from Helicobacter pylori 

Helicobacter pylori strain ORV2001, stored in LB medium containing 50% 
glycerol at -70"* C, is grown on Colombia agar containing 7% sheep blood for 48 
hours under microaerophilic conditions (8-10% CO2, 5-7% O2, 85-87% Nj). Cells 
are harvested, washed with phosphate buffer saline (PBS) (pH 7.2), and DNA is 
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then extracted from the cells using the Rapid Prep Genomic DNA Isolation kit 
(Pharmacia Biotech). 

3.B. PGR amplification 

DNA molecules encoding GHPO 1360 and GHPO 750 are amplified from 
genomic DNA, as can be prepared as is described above, by the Polymerase Chain 
Reaction (PGR) using the following primers: 
GHPO 1360: 

N-terminal primer 

5'-CGCGGATCCGAATGAAAAAAAATATCTTAAAT-3' (SEQ IDN0:5); and 

C-terminal primer: 
5'-CCGCTCGAGTTACTTGTTGATAACAATnT-3' (SEQ ID N0:6). 
GHPO 750: 

N-terminal primer: 

5'-CGCGGATCCGAATGGCAAAAGAAAAGTTTAAC-3' (SEQ ID N0:7); and 

C-terminal primer: 
5'-CCGCTCGAGTTATTCAATAATATTGCTCAC-3' (SEQ ID N0:8). 

The N-terminal and C-terminal primers for each clone both include a 5' 
clamp and a restriction enzyme recognition sequence for cloning purposes (BamUl 
(GGATCC) and^ol (CTCGAG) recognition sequences). In the case of GHPO 
1360, the N-terminal primer is designed so that the amplified product does not 
encode the leader sequence and the potential cleavage site. 

Amplification of gene-specific DNA is carried out using Thermalase DNA 
Polymerase (Amresco) according to the manufacturer's instructions. The reaction 
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mixture, which is brought to a final volume of 100 jiL with distilled water, is as 
follows: 



dNTPsmix 



200 



lOx ThermoPol buffer 



10 



pnmers 



300 nM each 



DNA template 



50 ng 



Thermalase DNA polymerase 



2imits 



Appropriate amplification reaction conditions can readily be determined by 
one skilled in the art. In the present case, the following conditions were used: a 
denaturing step was carried out at 95 °C for 30 seconds, followed by an annealing 
step at 55 ""C for one minute, and an extension step at 72 °C for 2 minutes. Thirty 
cycles were carried out. 

3.C. Transformation and selection of transformants 

A single PGR product is thus amplified and is then digested at 37'' C for 2 
hours with BamHl mdXhol concurrently in a 20 /uL reaction volume. The 
digested product is ligated to similarly cleaved pET28a (Novagen) that is 
dephosphorylated prior to the ligation by treatment with Calf Intestinal Alkaline 
Phosphatase (CIP). The gene fusion constmcted in this maimer allows one-step 
affinity purification of the resulting fiision protein because of the presence of 
histidine residues at the N-terminus of the fiision protein, which are encoded by the 
vector. 

The ligation reaction (20 /^L) is carried out at 14°C overnight and then is 
used to transform 100 fuL firesh E. coli XL 1 -blue competent cells (Novagen). The 
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cells are incubated on ice for 2 hours, then heat-shocked at 42''C for 30 seconds, 
and returned to ice for 90 seconds. The samples are then added to 1 mL LB broth 
in the absence of selection and grown at 37 °C for 2 hours. The cells are then 
plated out on LB agar containing kanamycin (50 /^g/mL) at a 1 Ox and neat dilution 
5 and incubated overnight at 37 ""C. The following day, 50 colonies are picked onto 
secondary plates and incubated at 37 ""C overnight 

Five colonies are picked into 3 mL LB broth supplemented with kanamycin 
(100 /^g/mL) and are grown overnight at 37''C. Plasmid DNA is extracted using 
the Quiagen mini-prep, method and is quantitated by agarose gel electrophoresis. 

1 0 PGR is performed with the gene-specific primers under the conditions stated 

above and transformant DNA is confirmed to contain the desired insert. If PCR- 
positive, one of the five plasmid DNA samples (500 ng) extracted jfrom the E. coli 
XLl-blue cells is used to transform competent BL21 (^DE3) E. coli competent 
cells (Novagen; as described previously). Transformants (10) are picked onto 

15 selective kanamycin (50 [ig/mL) containing LB agar plates and stored as a 
research stock in LB containing 50% glycerol. 

3.D. Purification of recombinant proteins 

One mL of frozen glycerol stock prepared as described in 3.C. is used to 
inoculate 50 mL of LB medium containing 25 ^ig/mL of kanamycin in a 250 mL 
2 0 Erlenmeyer flask. The flask is incubated at 37°C for 2 hours or until the 

absorbance at 600 nm (ODgoo) reaches 0.4-1.0. The culture is stopped from 
growing by placing the flask at 4°C ovemight. The following day, 10 mL of the 
ovemight culture are used to inoculate 240 mL LB medium containing kanamycin 
(25 |ig/mL), with the initial ODgQO about 0.02-0.04. Four flasks are inoculated for 




each ORF. The cells are grown to an ODgQO of 1-0 (about 2 hours at 37°C), a 1 
mL sample is harvested by centrifugation, and the sample is analyzed by SDS- 
PAGE to detect any leaky expression. The remaining culture is induced with 1 
mM IPTG and the induced cultures are grown for an additional 2 hours at 37°C. 
5 The final ODgoo taken and the cells are harvested by centrifugation at 

5,000 X g for 15 minutes at 4°C. The supernatant is discarded and the pellets are 
resuspended in 50 mM Tris-HCl (pH 8.0), 2 mM EDTA. Two hundred and fifty 
mL of buffer are used for 1 L of culture and the cells are recovered by 
centrifugation at 12,000 x g for 20 minutes. The supematant is discarded and the 
1 0 pellets are stored at -45°C. 

3.£. Protein purification 

Pellets obtained from 3.D. are thawed and resuspended in 95 mL of 50 mM 
Tris-HCl (pH 8.0). Pefabloc and lysozyme are added to final concentrations of 
100 |j.M and 100 ^g/mL, respectively. The mixture is homogenized with magnetic 
15 stirring at 5*^C for 30 minutes. Benzonase (Merck) is added at a 1 U/mL final 
concentration, in the presence of 10 mM MgCl2, to ensure total digestion of the 
DNA. The suspension is sonicated (Branson Sonifier 450) for 3 cycles of 2 
minutes each at maximum output. The homogenate is centrifiiged at 19,000 x g 
for 15 minutes and both the supematant and the pellet are analyzed by SDS-PAGE 
20 to detect the cellular location of the target protein in the soluble or insoluble 
fractions, as is described further below. 
3.E.I. Soluble fraction 
If the target protein is produced in a soluble form (Le,, in the supematant 
obtained in 3.E.) NaCl and imidazole are added to the supematant to final 
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concentrations of 50 mM Tris-HCl (pH 8.0), 0.5 M NaCl, and 10 mM imidazole 
(buffer A). The mixture is filtered through a 0.45 |Lim membrane and loaded onto 
an IMAC column (Pharmacia HiTrap chelating Sepharose; 1 mL), which has been 
charged with nickel ions according to the manufacturer's recommendations. After 
5 loading, the column is washed with 50 column volumes of buffer A and the 

recombinant target protein is eluted with 5 mL of buffer B (50 mM Tris-HCl (pH 
8.0), 0.5 M NaCl, 500 mM imidazole). 

The elution profile is monitored by measuring the absorbance of the 
fractions at 280 nm. Fractions corresponding to the protein peak are pooled, 
10 dialyzed against PBS containing 0.5 M arginine, filtered through a 0.22 )j.m 
3 membrane, and stored at -45°C. 

3.E.2. Insoluble fraction 
H If the target protein is expressed in the insoluble fraction (pellets obtained 

;g from 3.E.), purification is conducted under denaturing conditions. NaCl, 
==5 15 imidazole, and urea are added to the resuspended pellet to final concentrations of 
50 mM Tris-HCl (pH 8.0), 0.5 M NaCl, 10 mM imidazole, and 6 M urea (buffer 
:i C). After complete solubilization, the mixture is filtered through a 0.45 |im 
membrane and loaded onto an IMAC column. 

The purification procedures on the IMAC column are the same as described 
20 in 3 .E. 1 ,, except that 6 M urea is included in all buffers used and 1 0 column 

volumes of buffer C are used to wash the column after protein loading, instead of 
50 column volumes. 

The protein fractions eluted from the IMAC column with buffer D (buffer C 
containing 500 mM imidazole) are pooled. Arginine is added to the solution to 
25 final concentration of 0.5 M and the mixture is dialyzed against PBS containing 
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0.5 M arginine and various concentrations of urea (4 M, 3 M, 2 M, 1 M, and 0.5 
M) to progressively decrease the concentration of urea. The final dialysate is 
filtered through a 0.22 fim membrane and stored at -45°C. 

Alternatively, when the above purification process is not as efficient as it 
should be, two other processes may be used as follows. A first alternative involves 
the use of a mild denaturant, N-octyl glucoside (NOG). Briefly, a pellet obtained 
in 3.E. is homogenized in 5 mM imidazole, 500 mM sodium chloride, 20 mM Tris- 
HCl (pH 7.9) by microfluidization at a pressure of 15,000 psi and is clarified by 
centrifligation at 4,000-5,000 x g. The pellet is recovered, resuspended in 50 mM 
NaP04 (pH 7,5) containing 1-2 % weight /volume NOG, and homogenized. The 
NOG-soluble impurities are removed by centrifligation. The pellet is extracted 
once more by repeating the preceding extraction step. The pellet is dissolved in 8 
M urea, 50 mM Tris (pH 8.0). The urea-solubilized protein is diluted with an 
equal volume of 2 M arginine, 50 mM Tris (pH 8.0), and is dialyzed against 1 M 
arginine for 24-48 hours to remove the urea. The final dialysate is filtered through 
a 0.22 fim membrane and stored at -45°C. 

A second alternative involves the use of a strong denaturant, such as 
guanidine hydrochloride. Briefly, a pellet obtained in 3.E. is homogenized in 5 
mM imidazole, 500 mM sodium chloride, 20 mM Tris-HCl (pH 7,9) by 
microfluidization at a pressure of 15,000 psi and clarified by centrifligation at 
4,000-5,000 X g. The pellet is recovered, resuspended in 6 M guanidine 
hydrochloride, and passed through an IMAC colunm charged with Ni-H-. The 
bound antigen is eluted with 8 M urea (pH 8.5). Beta-mercaptoethanol is added to 
the eluted protein to a final concentration of 1 mM, then the eluted protein is 
passed through a Sephadex G-25 column equilibrated in 0.1 M acetic acid. Protein 
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eluted from the column is slowly added to 4 volumes of 50 mM phosphate buffer 
(pH 7.0), and it remains in solution. 

3.F. Evaluation of the protective activity of the purified protein 

A protection test is described above that was carried out for testing the 
5 protective activity of the purified, native proteins. This test can also be used for 
testing the protective efficacy of recombinant proteins. Altematively, the 
following test can be used. 

Groups of 10 OF 1 mice (IFFA Credo) are immunized rectally wiHi 25 /j.g of 
the purified recombinant protein, admixed with 1 jUg of cholera toxin (Bema) in 
10 physiological buffer. Mice are immunized on days 0, 7, 14, and 21. Fourteen days 
after the last immunization, the mice are challenged with H, pylori strain 
ORV2001 grown in liquid media (the cells are grown on agar plates, as described 
in I.A. and, after harvest, the cells are resuspended in Brucella broth; the flasks are 
then incubated overnight at 37 °C). Fourteen days after challenge, the mice are 
15 sacrificed and flieir stomachs are removed. The amount of K pylori is determined 
by measuring the urease activity in tiie stomach and by culture. 

3.G. Production of monospecific polyclonal antibodies 
3.G,1, Hyperimmune rabbit antiserum 

New Zealand rabbits are injected both subcutaneously and intramuscularly 
2 0 with 1 00 yug of a purified ftision polypeptide, as obtained in 3.E. 1 . or 3.E.2., in the 
presence of Freund's complete adjuvant and in a total volume of approximately 2 
mL. Twenty one and 42 days after the initial injection, booster doses, which are 
identical to priming doses, except that Freund's incomplete adjuvant is used, are 
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administered in the same way. Fifteen days after the last injection, animal serum is 
recovered, decomplemented, and filtered through a 0.45 /^m membrane. 
3.G.2. Mouse hyperimmune ascites fluid 

Ten mice are injected subcutaneously with 10-50 f^g of a purified ftision 
5 polypeptide as obtained in 3.E.I. or 3.E.2., in the presence of Freund's complete 
adjuvant and in a volume of approximately 200 fuL. Seven and 14 days after the 
initial injection, booster doses, which are identical to the priming doses, except 
that Freund's incomplete adjuvant is used, are administered in the same way. 
Twenty one and 28 days after the initial infection, mice receive 50 /xg of the 
1 0 antigen alone intraperitoneally. On day 2 1 , mice are also injected intraperitoneally 
□ with sarcoma 180/TG cells CM26684 (Lennette et aL, Diagnostic Procedures for 
Viral, Rickettsial, and Chlamydial Infections, 5th Ed. Washington DC, American 
Q Public Health Association, 1979). Ascites fluid is collected 10-13 days after the 
last injection. 

15 EXAMPLE 4: Methods for producing transcriptional fusions lacking His- 
y tags 

fU Methods for ampHfication and cloning of DNA encoding GHPO 1360 and 

GHPO 750 as transcriptional fiisions lacking His-tags are described as fi)llows. 
Two PGR primers for each clone are designed based upon the sequences of the 
2 0 polynucleotides that encode GHPO 1360 (SEQ ID NO: 1) and GHPO 750 (SEQ ID 
N0:3). These primers can be used to amplify DNA encoding GHPO 1360 and 
GHPO 750 from any Helicobacter pylori strain including, for example, ORV2001 
and the Helicobacter pylori strain deposited with the American Type Culture 
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Collection (ATCC, Rockville, Maryland) as ATCC number 43579, as well as from 
other Helicobacter species. 

The N-terminal primers are designed to include the ribosome binding site of 
the target gene, the ATG start site, and, in the case of GHPO 1360, the leader 
sequence (with cleavage site). The N-terminal primers include a 5' clamp and a 
BamHl recognition sequence (GGATCC), which facilitates subsequent cloning. 
The C-terminal primers include mXhol recognition sequence, which can be used 
in subsequent cloning, and a TAA stop codon. 
GHPO 1360: 

N-terminal primer: 

5*-GTGGAGAACACACAATGAAAAAAAATATC-3* (SEQ ID NO: 14); and 
C-teraiinal primer: 

5'-GCTAATATTATTCAATAATATTGCTCACAAC-3' (SEQ ID N0:15). 
GHPO 750: 

N-terminal primer: 
5'-GGAGAAATACAAATGGCAAAAGAAAAG-3' (SEQ ID N0:16); and 

C-terminal primer: 

5*-GCTAATATTATTCAATAATATTGCTCACAAC-3' (SEQ ID N0:17). 

Amplification of genes encoding GHPO 1360 and GHPO 750 is carried out 
using Thermalase DNA Polymerase under the conditions described above in 
Example 3. Alternatively, Vent DNA polymerase (New England Biolabs) or Pwo 
DNA polymerase (Boehringer Mannheim) can be used, according to instructions 
provided by the manufacturers. 

In the case where the 5* end of the 5' primer includes a BamHl site and the 
5' end of the 3' primer includes anXhol site (see above), a single PGR product for 
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each of GHPO 1360 and GHPO 750 is amplified and cloned into BamHl-Xhol 
cleaved pET 24, resulting in construction of a transcriptional fusion, which permits 
expression of the proteins without His-tags. The expressed products can be 
purified as denatured proteins that are refolded by dialysis into 1 M arginine. 
5 Cloning into pET 24 allows transcription of the genes from the T7 

promoter, which is supplied by the vector, but relies upon binding of the RNA- 
specific DNA polymerase to the intrinsic ribosome binding sites of GHPO 1360 
and GHPO 750, and thereby expression of the complete ORF. The amplification, 
digestion, and cloning protocols are as described above for constructing 
1 0 translational fusions. 

3 

ij EXAMPLE 5: Purification of GHPO 1360 and GHPO 750 by immunoaffinity 

y 

5.A« Purification of specificIgGs 

2 An immune serum, as prepared in section 3.G., is appHed to a protein A 

: Sepharose Fast Flow column (Pharmacia) equilibrated in 100 mM Tris-HCl (pH 
15 8.0), The resin is washed by applying 10 column volumes of 100 mM Tris-HCl 
y and 10 volumes of 10 mM Tris-HCl (pH 8.0) to the column. IgG antibodies are 
eluted with 0.1 M glycine buffer (pH 3.0) and are collected as 5 mL fractions to 
which is added 0.25 mL 1 M Tris-HCl (pH 8.0). The optical density of the eluate 
is measured at 280 nm and the fractions containing the IgG antibodies are pooled, 
2 0 dialyzed against 50 mM Tris-HCl (pH 8.0) and, if necessary, stored frozen at 
-70X. 
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5.B. Preparation of the column 

An appropriate amount of CNBr-activated Sepharose 4B gel (1 g of dried 
gel provides for approximately 3.5 mL of hydrated gel; gel capacity is from 5 to 10 
mg coupled IgG/mL of gel) manufactured by Pharmacia (17-0430-01) is 
5 suspended in 1 mM HCl buffer and washed with a buchner by adding small 

quantities of 1 mM HCl buffer. The total volume of buffer is 200 mL per gram of 
gel. 

Purified IgG antibodies are dialyzed for 4 hours at 20±5''C against 
50 volumes of 500 mM sodium phosphate buffer (pH 7.5). The antibodies are then 

10 diluted in 500 mM phosphate buffer (pH 7.5) to a final concentration of 3 mg/mL. 
IgG antibodies are mixed with the gel ovemight at 5±3 °C. The gel is 
packed into a chromatography column and is washed with 2 column volumes of 
500 mM phosphate buffer (pH 7.5), and 1 column volume of 50 mM sodium 
phosphate buffer, containing 500 mM NaCl (pH 7.5). The gel is then transferred 

15 to a tube, mixed with 100 mM ethanolamine (pH 7.5) for 4 hours at room 

temperature, and washed twice with 2 column volumes of PBS. The gel is then 
stored in 1/10,000 PBS merthiolate. The amount of IgG antibodies coupled to the 
gel is determined by measuring the optical density (OD) at 280 nm of the IgG 
solution and the direct eluate, plus washings. 

20 5,C. Adsorption and elution of the antigen 

An antigen solution in 50 mM Tris-HCl (pH 8.0), 2 mM EDTA, for 
example, the supematant obtained in 3.E.L or the solubilized pellet obtained in 
3.E.2., after centrifugation and filtration through a 0.45 ^m membrane, is applied 
to a column equilibrated with 50 mM Tris-HCl (pH 8.0), 2 mM EDTA, at a flow 
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rate of about 10 mL/hour. The column is then washed with 20 volumes of 50 mM 
Tris-HCl (pH 8.0), 2 mM EDTA. Altematively, adsorption can be achieved by 
mixing ovemight at 5^3*^0. 

The adsorbed gel is washed with 2 to 6 volumes of 10 mM sodium 
phosphate buffer (pH 6.8) and the antigen is eluted with 100 mM glycine buffer 
(pH 2.5). The eluate is recovered in 3 mL fractions, to each of which is added 150 
/^L of 1 M sodium phosphate buffer (pH 8.0). Absorption is measured at 280 nm 
for each fraction; those fractions containing the antigen are pooled and stored at - 
20"C. 

* Other embodiments are within the following claims. 
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(i) APPLICANT: Kleanthous, Harold et al. 

(ii) TITLE OF THE INVENTION: Helicobacter GHPO 1360 and 

GHPO 750 Polypeptides and Corresponding Polynucleotides 

(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Clark & Elbing LLP 

(B) STREET: 176 Federal Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE; Ol-APR-1997 

(C) CLASSIFICATION: 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Clark, Paul T. 

(B) REGISTRATION NUMBER: 30,175 

(C) REFERENCE/DOCKET NUMBER: 06132/037001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-428-0200 

(B) TELEFAX: 617-428-7045 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1149 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 106... 1002 
(D) OTHER INFORMATION: 



10 



(A) NAME/KEY: Signal Sequence 

(B) LOCATION: 106... 166 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



TTACTCTTTA ATGTGAGTTT TCTGTGTCAT GATAGCTGAT TTTGTTTTAA ATTTGCTATA 60 
ATGTGAATTT AATGATGAAA ATTAGTTTAG AGTGGAGAAC ACACA ATG AAA AAA AAT 117 

Met Lys Lys Asn 
15 -20 

ATC TTA AAT TTA GCG TTA GTG GGT GCG TTG AGC ACG TCG TTT TTG ATG 165 

O lie Leu Asn Leu Ala Leu Val Gly Ala Leu Ser Thr Ser Phe Leu Met 

O -15 -10 -5 

kM 

;yg GCT AAG CCG GCT CAT AAC GCA AAT AAC GCT ACG CAT AAC ACG AAA AAA 213 

M- 20 Ala Lys Pro Ala His Asn Ala Asn Asn Ala Thr His Asn Thr Lys Lys 

m ^ 5 10 15 

^ ACG ACT GAT TCT TCA GCA GGC GTG TTA GCG ACA GTG GAT GGC AGA CCT 261 

Q Thr Thr Asp Ser Ser Ala Gly Val Leu Ala Thr Val Asp Gly Arg Pro 

20 25 30 

~- 25 ATC ACT AAA AGC GAT TTT GAC ATG ATT AAG CAA CGA AAT CCT AAT TTT 309 
lie Thr Lys Ser Asp Phe Asp Met lie Lys Gin Arg Asn Pro Asn Phe 
35 40 45 

GAT TTT GAC AAG CTT AAA GAG AAA GAA AAA GAA GCC TTG ATT GAT CAA 357 
Asp Phe Asp Lys Leu Lys Glu Lys Glu Lys Glu Ala Leu He Asp Gin 
30 50 55 60 

GCT ATT CGC ACC GCC CTT GTA GAA AAT GAA GCT AAA ACC GAG AAA TTG 405 
Ala He Arg Thr Ala Leu Val Glu Asn Glu Ala Lys Thr Glu Lys Leu 
65 70 75 80 

GAC AGC ACT CCA GAA TTT AAA GCG ATG ATG GAA GCG GTT AAA AAA CAG 453 
35 Asp Ser Thr Pro Glu Phe Lys Ala Met Met Glu Ala Val Lys Lys Gin 

85 90 95 

GCT TTA GTG GAA TTT TGG GCT AAA AAA CAG GCT GAA GAA GTG AAA AAA 501 
Ala Leu Val Glu Phe Trp Ala Lys Lys Gin Ala Glu Glu Val Lys Lys 
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100 105 110 

GTC CAA ATC CCA GAA AAA GAA ATG CAA GAT TTT TAC AAC GCT AAC AAA 549 
Val Gin lie Pro Glu Lys Glu Met Gin Asp Phe Tyr Asn Ala Asn Lys 
115 120 125 

5 GAT CAG CTT TTT GTC AAG CAA GAA GCC CAT GCT AGG CAT ATT TTA GTG 597 
Asp Gin Leu Phe Val Lys Gin Glu Ala His Ala Arg His lie Leu Val 
130 135 140 

AAA ACC GAA GAT GAG GCT AAA CGG ATT ATT TCT GAG ATT GAC AAA CAG 645 
Lys Thr Glu Asp Glu Ala Lys Arg He He Ser Glu He Asp Lys Gin 
10 145 150 155 160 

CCA AAG GCT AAA AAA GAA GCT AAA TTC ATT GAG TTA GCC AAT CGG GAT 693 
Pro Lys Ala Lys Lys Glu Ala Lys Phe He Glu Leu Ala Asn Arg Asp 
165 170 175 

ACG ATT GAT CCT AAC AGC AAG AAC GCG CAA AAT GGC GGT GAT TTG GGG 741 
15 Thr He Asp Pro Asn Ser Lys Asn Ala Gin Asn Gly Gly Asp Leu Gly 
180 185 190 

AAA TTC CAA AAG AAC CAA ATG GCT CCG GAT TTT TCT AAA GCC GCT TTC 789 
UJ Lys Phe Gin Lys Asn Gin Met Ala Pro Asp Phe Ser Lys Ala Ala Phe 
^ 195 200 205 

IH 20 GCT TTA ACT CCT GGG GAT TAC ACT AAA ACC CCT GTT AAA ACA GAG TTT 837 
iy Ala Leu Thr Pro Gly Asp Tyr Thr Lys Thr Pro Val Lys Thr Glu Phe 
210 215 220 

jl GGT TAT CAT ATT ATC TAT TTG ATT TCT AAA GAT AGC CCT GTA ACT TAT 885 

Gly Tyr His He He Tyr Leu He Ser Lys Asp Ser Pro Val Thr Tyr 
_ 25 225 230 235 240 

ACT TAT GAA CAG GCT AAA CCT ACC ATT AAG GGG ATG TTA CAA GAA AAG 933 
Thr Tyr Glu Gin Ala Lys Pro Thr He Lys Gly Met Leu Gin Glu Lys 
245 250 255 

CTT TTC CAA GAA CGC ATG AAT CAA CGC ATT GAG GAA CTA AGA AAG CAC 981 
3 0 Leu Phe Gin Glu Arg Met Asn Gin Arg He Glu Glu Leu Arg Lys His 
260 265 270 

GCT AAA ATT GTT ATC AAC AAG TAATTGATGA GGTGTTATCA TGTTAGTTAA AGGC 1036 
Ala Lys He Val He Asn Lys 
275 

35 AATGAAATTT TATTGAAAGC CCATAAAGAA GGTTATGGGG TGGGGGCGTT TAATTTCGTG 1096 
AATTTTGAAA TGCTAAACGC TATTTTTGAA GCAGGAAATG AGGAAAATTC CCC 1149 

(2) INFORMATION FOR SEQ ID N0:2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 299 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
(ix) FEATURE: 

(A) NAME/KEY: Signal Sequence 
10 (B) LOCATION: 1...20 

(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Lys Asn He Leu Asn Leu Ala Leu Val Gly Ala Leu Ser Thr 
15 -20 -15 -10 ~5 

Ser Phe Leu Met Ala Lys Pro Ala His Asn Ala Asn Asn Ala Thr His 
2 1 5 10 

Asn Thr Lys Lys Thr Thr Asp Ser Ser Ala Gly Val Leu Ala Thr Val 
F; 15 20 25 

& 20 Asp Gly Arg Pro He Thr Lys Ser Asp Phe Asp Met He Lys Gin Arg 
m 30 35 40 

Asn Pro Asn Phe Asp Phe Asp Lys Leu Lys Glu Lys Glu Lys Glu Ala 
10 45 50 55 60 

W Leu He Asp Gin Ala He Arg Thr Ala Leu Val Glu Asn Glu Ala Lys 
^ 25 65 70 75 

O Thr Glu Lys Leu Asp Ser Thr Pro Glu Phe Lys Ala Met Met Glu Ala 

80 85 90 

Val Lys Lys Gin Ala Leu Val Glu Phe Trp Ala Lys Lys Gin Ala Glu 
95 100 105 

30 Glu Val Lys Lys Val Gin He Pro Glu Lys Glu Met Gin Asp Phe Tyr 
;:j 110 115 120 

Asn Ala Asn Lys Asp Gin Leu Phe Val Lys Gin Glu Ala His Ala Arg 
125 130 135 140 

His He Leu Val Lys Thr Glu Asp Glu Ala Lys Arg He He Ser Glu 
35 145 150 155 

He Asp Lys Gin Pro Lys Ala Lys Lys Glu Ala Lys Phe He Glu Leu 

160 165 170 

Ala Asn Arg Asp Thr He Asp Pro Asn Ser Lys Asn Ala Gin Asn Gly 
175 180 185 

40 Gly Asp Leu Gly Lys Phe Gin Lys Asn Gin Met Ala Pro Asp Phe Ser 
190 195 200 

Lys Ala Ala Phe Ala Leu Thr Pro Gly Asp Tyr Thr Lys Thr Pro Val 
205 210 215 220 

Lys Thr Glu Phe Gly Tyr His He He Tyr Leu He Ser Lys Asp Ser 
45 225 230 235 

Pro Val Thr Tyr Thr Tyr Glu Gin Ala Lys Pro Thr He Lys Gly Met 
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240 245 250 

Leu Gin Glu Lys Leu Phe Gin Glu Arg Met Asn Gin Arg He Glu Glu 

255 260 265 

Leu Arg Lys His Ala Lys He Val He Asn Lys 
5 270 275 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1448 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
15 (B) LOCATION: 118... 1314 

(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

JJJ CTCTTGAATG GCGATAAGAC AAAAATGTCT TAAATTTTGT GGTAGCATTT AGGAATACTT 60 

^ 20 AGGATTTTGT TTAGTATAAT TCTAAAATCC ATTTCAAAAA ATTAAGGAGA AATACAA ATG 120 

1 

O GCA AAA GAA AAG TTT AAC AGA ACT AAG CCG CAT GTT AAT ATT GGA ACC 168 

H Ala Lys Glu Lys Phe Asn Arg Thr Lys Pro His Val Asn He Gly Thr 

□ 25 5 10 15 

□ ATT GGG CAT GTA GAC CAT GGT AAA ACG ACT TTG AGT GCA GCG ATT TCA 216 
fy He Gly His Val Asp His Gly Lys Thr Thr Leu Ser Ala Ala He Ser 

20 25 30 

GCG GTG CTT TCT TTG AAA GGT CTT GCA GAA ATG AAA GAC TAT GAT AAT 264 
30 Ala Val Leu Ser Leu Lys Gly Leu Ala Glu Met Lys Asp Tyr Asp Asn 
35 40 45 

ATT GAT AAC GCC CCT GAA GAA AAA GAA AGA GGG ATC ACT ATC GCT ACT 312 
He Asp Asn Ala Pro Glu Glu Lys Glu Arg Gly He Thr He Ala Thr 
50 55 60 65 

35 TCT CAC ATT GAA TAT GAG ACT GAA AAC AGA CAC TAT GCG CAT GTG GAT 360 
Ser His He Glu Tyr Glu Thr Glu Asn Arg His Tyr Ala His Val Asp 
70 75 80 

TGC CCA GGA CAC GCT GAC TAT GTA AAA AAC ATG ATC ACC GGT GCG GCG 408 
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Cys Pro Gly His Ala Asp Tyr Val Lys Asn Met He Thr Gly Ala Ala 
85 90 95 

CAA ATG GAC GGA GCG ATT TTG GTT GTT TCT GCA GCT GAT GGC CCT ATG 456 
Gin Met Asp Gly Ala He Leu Val Val Ser Ala Ala Asp Gly Pro Met 
100 105 110 

CCT CAA ACT AGG GAG CAT ATC TTA TTG TCT CGT CAA GTA GGC GTG CCT 504 
Pro Gin Thr Arg Glu His He Leu Leu Ser Arg Gin Val Gly Val Pro 
115 120 125 

CAC ATC GTT GTT TTC TTA AAC AAA CAA GAC ATG GTA GAT GAC CAA GAA 552 
His He Val Val Phe Leu Asn Lys Gin Asp Met Val Asp Asp Gin Glu 
130 135 140 145 

TTG TTA GAA CTT GTA GAA ATG GAA GTG CGC GAA TTG TTG AGC GCG TAT 600 
Leu Leu Glu Leu Val Glu Met Glu Val Arg Glu Leu Leu Ser Ala Tyr 
150 155 160 

15 GAA TTT CCT GGC GAT GAC ACT CCT ATC GTA GCG GGT TCA GCT TTA AGA 648 

^ Glu Phe Pro Gly Asp Asp Thr Pro He Val Ala Gly Ser Ala Leu Arg 
y 165 170 175 

W GCT TTA GAA GAA GCA AAG GCT GGT AAT GTG GGT GAA TGG GGT GAA AAA 696 

yj Ala Leu Glu Glu Ala Lys Ala Gly Asn Val Gly Glu Trp Gly Glu Lys 
1^ 20 180 185 190 

Ly GTG CTT AAA CTT ATG GCT GAA GTG GAT GCC TAT ATC CCT ACT CCA GAA 744 

- Val Leu Lys Leu Met Ala Glu Val Asp Ala Tyr He Pro Thr Pro Glu 
C3 195 200 205 

p AGA GAC ACT GAA AAA ACT TTC TTG ATG CCG GTT GAA GAT GTG TTC TCT 792 

hi 25 Arg Asp Thr Glu Lys Thr Phe Leu Met Pro Val Glu Asp Val Phe Ser 

O 210 215 220 225 

ATT GCG GGT AGA GGG ACT GTG GTT ACA GGT AGG ATT GAA AGA GGC GTG 840 
He Ala Gly Arg Gly Thr Val Val Thr Gly Arg He Glu Arg Gly Val 
230 235 240 

30 GTG AAA GTA GGC GAT GAA GTG GAA ATC GTT GGT ATC AGA CCT ACA CAA 888 
Val Lys Val Gly Asp Glu Val Glu He Val Gly He Arg Pro Thr Gin 
245 250 255 

AAA ACG ACT GTA ACC GGT GTA GAA ATG TTT AGG AAA GAG TTG GAA AAA 936 
Lys Thr Thr Val Thr Gly Val Glu Met Phe Arg Lys Glu Leu Glu Lys 
35 260 265 270 

GGT GAA GCC GGC GAT AAT GTG GGC GTG CTT TTG AGA GGA ACT AAA AAA 984 
Gly Glu Ala Gly Asp Asn Val Gly Val Leu Leu Arg Gly Thr Lys Lys 
275 280 285 
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GAA GAA GTG GAA CGC GGT ATG GTT CTA TGC AAA CCA GGT TCT ATC ACT 1032 
Glu Glu Val Glu Arg Gly Met Val Leu Cys Lys Pro Gly Ser lie Thr 
290 295 300 305 

COG CAC AAG AAA TTT GAG GGA GAA ATT TAT GTC CTT TCT AAA GAA GAA 1080 
5 Pro His Lys Lys Phe Glu Gly Glu He Tyr Val Leu Ser Lys Glu Glu 

310 315 320 

GGC GGG AGA CAC ACT CCA TTC TTC ACC AAT TAC CGC CCG CAA TTC TAT 1128 
Gly Gly Arg His Thr Pro Phe Phe Thr Asn Tyr Arg Pro Gin Phe Tyr 
325 330 335 

10 GTG CGC ACA ACT GAT GTG ACT GGC TCT ATC ACC CTT CCT GAA GGC GTA 1176 
Val Arg Thr Thr Asp Val Thr Gly Ser He Thr Leu Pro Glu Gly Val 
340 345 350 

GAA ATG GTT ATG CCT GGC GAT AAT GTG AAA ATC ACT GTA GAG TTG ATT 1224 
Glu Met Val Met Pro Gly Asp Asn Val Lys lie Thr Val Glu Leu He 
15 355 360 365 

AGC CCT GTT GCG TTA GAG TTG GGA ACT AAA TTT GCG ATT CGT GAA GGC 1272 
Ser Pro Val Ala Leu Glu Leu Gly Thr Lys Phe Ala He Arg Glu Gly 
370 375 380 385 

GGT AGG ACC GTT GGT GCT GGT GTT GTG AGC AAT ATT ATT GAA TAATATTAG 1323 
20 Gly Arg Thr Val Gly Ala Gly Val Val Ser Asn He He Glu 

390 395 

CAAAAAGAGA GTTACCATAA AGGGTCATTA TGAAAGTTAA AATAGGGTTG AAGTGTTCTG 1383 
ATTGTGAAGA TATCAATTAC AGCACAACCA AGAACGCTAA AACTAACACT GAAAAACTGG 1443 
AGCTT 1448 

25 (2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Ala Lys Glu Lys Phe Asn Arg Thr Lys Pro His Val Asn He Gly 
35 1 5 10 15 

Thr He Gly His Val Asp His Gly Lys Thr Thr Leu Ser Ala Ala He 

20 25 30 

Ser Ala Val Leu Ser Leu Lys Gly Leu Ala Glu Met Lys Asp Tyr Asp 
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35 40 45 

Asn He Asp Asn Ala Pro Glu Glu Lys Glu Arg Gly S.e Thr He Ala 

50 55 60 

Thr Ser His He Glu Tyr Glu Thr Glu Asn Arg His Tyr Ala His Val 
5 65 70 75 80 

Asp Cys Pro Gly His Ala Asp Tyr Val Lys Asn Met He Thr Gly Ala 

85 90 95 

Ala Gin Met Asp Gly Ala He Leu Val Val Ser Ala Ala Asp Gly Pro 
100 105 110 

10 Met Pro Gin Thr Arg Glu His He Leu Leu Ser Arg Gin Val Gly Val 
115 120 125 

Pro His He Val Val Phe Leu Asn Lys Gin Asp Met Val Asp Asp Gin 

130 135 140 

Glu Leu Leu Glu Leu Val Glu Met Glu Val Arg Glu Leu Leu Ser Ala 
15 145 150 155 160 

Tyr Glu Phe Pro Gly Asp Asp Thr Pro He Val Ala Gly Ser Ala Leu 

165 170 175 

Arg Ala Leu Glu Glu Ala Lys Ala Gly Asn Val Gly Glu Trp Gly Glu 
180 185 190 

20 Lys Val Leu Lys Leu Met Ala Glu Val Asp Ala Tyr He Pro Thr Pro 
195 200 205 

Glu Arg Asp Thr Glu Lys Thr Phe Leu Met Pro Val Glu Asp Val Phe 

210 215 220 

Ser He Ala Gly Arg Gly Thr Val Val Thr Gly Arg He Glu Arg Gly 
25 225 230 235 240 

Val Val Lys Val Gly Asp Glu Val Glu He Val Gly He Arg Pro Thr 

245 250 255 

Gin Lys Thr Thr Val Thr Gly Val Glu Met Phe Arg Lys Glu Leu Glu 
260 265 270 

30 Lys Gly Glu Ala Gly Asp Asn Val Gly Val Leu Leu Arg Gly Thr Lys 
275 280 285 

Lys Glu Glu Val Glu Arg Gly Met Val Leu Cys Lys Pro Gly Ser He 

290 295 300 

Thr Pro His Lys Lys Phe Glu Gly Glu He Tyr Val Leu Ser Lys Glu 
35 305 310 315 320 

Glu Gly Gly Arg His Thr Pro Phe Phe Thr Asn Tyr Arg Pro Gin Phe 

325 330 335 

Tyr Val Arg Thr Thr Asp Val Thr Gly Ser He Thr Leu Pro Glu Gly 
340 345 350 

40 Val Glu Met Val Met Pro Gly Asp Asn Val Lys He Thr Val Glu Leu 
355 360 365 

He Ser Pro Val Ala Leu Glu Leu Gly Thr Lys Phe Ala He Arg Glu 

370 375 380 

Gly Gly Arg Thr Val Gly Ala Gly Val Val Ser Asn He He Glu 
45 385 390 395 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
5 CGCGGATCCG AATGAAAAAA AATATCTTAA AT 32 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CCGCTCGAGT TACTTGTTGA TAACAATTTT 30 
(2) INFORMATION FOR SEQ ID NO: 7: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGCGGATCCG AATGGCAAAA GAAAAGTTTA AC 32 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCGCTCGAGT TATTCAATAA TATTGCTCAC 30 

3 0 (2) INFORMATION FOR SEQ ID NO : 9 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Lys Glu Lys Phe Asn Arg Thr Lys Pro His Val Asn He Gly Thr 
15 10 15 

10 He Gly His Val Asp His 
20 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 amino acids 

15 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

20 Ala His Asn Ala Asn Asn Ala Thr His Asn Thr Lys Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 

25 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 

3 0 Lys Pro Ala His Asn Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
35 (B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

5 lie Asp Lys Gin Pro Lys Ala Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 

10 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

15 Phe Trp Ala Lys Lys Gin Ala Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GTGGAGAACA CACAATGAAA AAAAATATC 29 

25 (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15! 
GCTAATATTA TTCAATAATA TTGCTCACAA C 



(2) INFORMATION FOR SEQ ID NO: 16: 



31 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGAGAAATAC AAATGGCAAA AGAAAAG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
:,15 GCTAATATTA TTCAATAATA TTGCTCACAA C 31 

= (2) INFORMATION FOR SEQ ID NO: 18: 



-n (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 
ffi (B) TYPE: nucleic acid 

1^20 (C) STRANDEDNESS: single 

7 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CATAACGCAA ATAACGCTAC GCAT 
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